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Abstract. The concentration of measure prenomenon roughly states that, if a set 
A in a product il,^ of probability spaces has measure at least one half, "most" of 
the points of are "close" to A. We proceed to a systematic exploration of this 
phenomenon. The meaning of the word "most" is made rigorous by isoperimetric- 
type inequalities that bound the measure of the exceptional sets. The meaning of 
the work "close" is defined in three main ways, each of them giving rise to related, 
but different inequalities. The inequalities are all proved through a common scheme 
of proof. Remarkably, this simple approach not only yields qualitatively optimal 
results, but, in many cases, captures near optimal numerical constants. A large 
number of applications are given, in particular in Percolation, Geometric Probability, 
Probability in Banach Spaces, to demonstrate in concrete situations the extremely 
wide range of application of the abstract tools. 
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1. Introduction 

Upon reading ttie words "isoperimetric inequality" the average reader is likely 
to think to the classical statement: 

(1.1) Upon the bodies of a given volume in M^, the ball is the one with the 

smallest surface 

This formulation, that needs the notion of surface area, is not very appropriate 
for generalization in abstract setting. A less known (equivalent) formulation is as 
follows: 

(1.2) Among the bodies A of a given volume in M^, the one for which the set 

of points within Euclidean distance t of A has minimum volume is the Euclidean 
ball. 

It should be intuitive, taking t — > 0, that (1.2) implies (1.1). We will, however, 
rather be interested in large values of t. A first sight, this is uninteresting; but this 
first impression is created only by our deficient intuition, that functions correctly 
only for N < 3, and lamely fails for the large values of N that are of interest here. 

For our point of view, the main feature of (1.2) is that it gives a lower bound on 
the volume of At that depends only on t and the volume of A. 

From now on, all the measures considered will be probabilities (i.e. of total mass 
one). The concept of concentration of measure phenomenon largely arose through 
the work of V. Milman on Dvoretzky's theorem on almost Euclidean sections of 
convex bodies. Following [G-M], [Mi-S], the basic ideas may be described in the 
following way. Consider a (Polish) metric space {X,d). For a subset A oi X, 
consider the d-ball At centered on A, i.e. 

(1.3) At = {xeX:d{x,A)<t}. 

Consider now a Borel probability measure P on X. The concentration function 
a{P,t) is defined as 

a{P,t) ^sup{l- P (At): P (A) A G X , A Borel} . 

In other words 

(1.4) ^(^)^^ ^ P{At)>l-a{P,t). 

It turns out that in many situations the function a{P,t) becomes extremely small 
when t grows. In rough words, if one starts with any set A of measure > 1/2, 
At is almost the entire space. This is the concentration of measure phenomenon, 
that was promoted most vigorously by V. Milman. It plays an important role in 
local theory of Banach spaces, and has become the central concept of the area 
of probability known as Probability in Banach spaces. (See the book [L-T2], and 
subsequent work such as [T6] , [T7] ) . 

A prime example of space where concentration of measure holds is the Euclidean 
sphere Sn oi equipped with its geodesic distance d and normalized Haar 

measure Pn, for which it can be shown that 

(1.5) a(P.,t)<Q"\.,[-i^e). 
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Closely related, and more in line with the topic of the present paper is the case 
X = R^, equipped with the Euclidean distance and the canonical Gaussian measure 
7iv (whose covariance is the Euclidean dot product). In that case 

/OO -1 -1 

It should be pointed out that more is known. The Gaussian isoperimetric in- 
equality states that 

(1.7) 7iv(A) = 7i((-oo, a]) =^ 7iv(^t) > 7i((-oo, a + t]) 

which implies (1.6) when a = 0. However, it is sufficient for many applications to 
know (1.6) or even the weaker inequality 

(1.8) «(7Ar,t)<i^e-*'/-^ 

where is a universal constant. 

In the present work we perform a systematic investigation of the concentration 
of measure phenomenon in product spaces. Thus with the terminology above, X 
will be a product of probability spaces, and P a product measure. The statements 
will have the form (1.4). However, the set At, which consists of points close in a 
certain sense to A, (and that, for convenience, we will call the t-fattening of A) will 
not always have the form (1.3). Indeed, it turns out that it is extremely fruitful to 
consider various notions of fattening. We will define three rather distinct notions 
of fattening. These notions are studied respectively in Chapters 2 to 4. Each 
of these notions can be studied with various level of sophistication, and they are 
at times closely connected. Discussing the whole theory in this introduction would 
require too much repetition and is inappropriate for an article of the present length. 
Thereby, we have decided to mention here only the main new theme (that did not 
appear in this author's previous work) as well as a simple result that appears to 
have a considerable potential for applications. 

Assume that X = is a product of probability spaces, and that P = /i^ is a 
product probability. We recall that the Hamming distance cZ on X is given by 

(1.9) d{x, y) — card{i < N:xi ^ i/i} . 



When Af is given by (1.3), where d is the Hamming distance, an important 
result, proved in special cases in [Mi-S] (with a proof that extends verbatim to the 
general situation) is that the concentration function a{P,t) satisfies 

(1.10) a(P,t)<Kexp 

where K is a universal constant. 

One could interpret (1.9) by saying that we put a penalty 1 for each coordinate i 
where Xi ^ yi. One recurring theme of the present paper is the investigation of what 
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happens when, instead, we put a penalty h{xi,yi), where h{x,y) is a non- negative 
function on A striking and unexpected finding is that in several instances there 
is a high disymmetry between the roles of x and y. For example, in one of the main 
results of the paper (Theorem 4.4.1) if one requires that h{x,y) should depend 
on X only, it has to be bounded; but, if it depends on y only, weak integr ability 
conditions suffice. 

Suppose now that {ai)i<N are positive numbers, and let us replace the distance 
(1.9) by 

da{x,y) - Yl ^^^{.^ii^vi) ■ 

i<N 

It is then shown in [Mi-S] that (1-10) can be extended into 
(1.11) a{P,t)<Kexp 



One way to spell out this result is as follows: 

Given A C fl^ , with P{A) > |, then, for all numbers {ai)i<N, ctj > 0, ^ af 
1, we have 



i<N 



(1.12) p(A,,^)>l_Kexp(-- 



where 



> . 



At,a = <x en^:3y e A, ^ < t 

[ i<N 

The first result of Chapter 4 states that (1.12) can be improved into 
(1-14) p(n'^t,o}j >l-Kexp(^-^^ 

where the intersection is over all families a = {ai)i<N as above. The power of this 
principle (that will be considerably perfected in Chapter 4) is by no means obvious 
at first sight, but will be demonstrated repeatedly through Chapters 6 to 9 (the 
easiest applications being in Chapter 6 and 7). 

We have explained in terms of sets what is the concentration of measure phe- 
nomenon. However, rather than sets, one is more often interested in functions. In 
that case, the concentration of measure phenomenon takes the following form: if a 
function / on X is sufficiently regular, it is very concentrated around its median 
(hence around its mean). If Mf is a median of /, this is expressed by a (fast de- 
creasing) bound on P(|/ — M/| > t). For a simple example, (1.4) implies that if / 
has a Lipschitz constant 1 with respect to the underlying distance 

(1.15) P{\f-Mf\>t)<2a{P,t). 
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Despite the fact that functions are potentiaUy more important than sets, all 
our concentration of measure results are stated in terms of sets. (This is done in 
Part I.) The essential reason for this choice is that the power and the generality of 
these results largely arise from the fact that they require only minimal structure 
(a condition better achieved by considering sets only). A secondary reason is that 
much of the progress reported on the present paper (including on some rather 
concrete questions presented in Part II) has been permitted, or at least helped 
by the abstract point of view; and thereby, it seems worthwhile to promote this 
approach. Nevertheless, the natural domain of application of the tools of Part I 
is the obtention of bounds on P(|/ — M/| > t) when / is a function defined on a 
product of measure spaces. We will, however, give no abstract statement of this 
type. We prefer instead to analyze a number of specific situations, reducing each 
time to statements about sets (the great variety of situations encountered indicates 
that this is possibly a clever choice). This is the purpose of Part II, where we will 
demonstrate the efiiciency of the tools of Part I. It must be said that these specific 
situations have been of considerable help in pointing out the directions in which 
the abstract theory should be developed. Most of the abstract results are indeed 
directly motivated by applications. 

Certainly there is a considerable number of situations where occur naturally 
functions that are defined on a product of many measure spaces, or equivalently 
that depend on many independent random variables. The examples presented here 
are certainly infiuenced by the past interests of the author. Their boundary, how- 
ever, is likely to refiect the limited knowledge of this author rather than the limit of 
the power of abstract tools of Part I. (Should a reader be aware of another potential 
domain of application, he is urged to introduce it to this author.) Quite logically, 
several of the examples we present have an "applied" fiavor. This is simply be- 
cause stochastic models occur in physics (such as Percolation and spin glasses) and 
Computer Science (bin packing, assignment problem, geometric probability). The 
reason for the later is that these stochastic models do shed some light on the behav- 
ior of computationally intractable problems, and, for this reason, are widely studied 
today; see e.g., [C-L], [ ]. No previous knowledge whatsoever of these problems is 
required for reading the material of Part II, that we briefiy describe now. 

Each of the examples of Part II studies the deviation of a specific function / of 
many independent random variables from its mean. In each example, the function 
/ is obtained as the solution of an optimization problem. This is not a coincidence, 
but rather reflects the fact that such situations are well adapted to the use of our 
methods. In Chapter 6, we apply (4.1.3) to stochastic bin packing. This simple 
application is presented first since it is while considering this problem (while proc- 
toring a Calculus exam) that the power of (4.1.3) beyond Probabilities in Banach 
spaces was first realized. The application is not really typical. More typical is the 
application of Chapter 7, to the length of the longest increasing subsequence of a 
random permutation. This application puts forward the fact that when one studies 
the size of substructures whose existence is determined by a comparatively small 
number of random variables, rather than by the whole collection of random vari- 
ables, inequality (4.1.3) fully takes advantage of that feature. This characteristic 
occurs again in Chapter 8, where it is presented as a general result, that allows, 
as a rather weak and special corollary, to improve upon H. Kesten's recent results 
on first time passage in Percolation [K2]. In Chapter 9, we show how (4.1.3) again 
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provides a natural approach to questions on random graphs. The chaUenge of the 
Assignment problem considered in Section 10 is that the objective function / con- 
sidered there is very small; it is of order one, while depending on N'^ independent 
variables of order one, each of them with a potentially disastrous influence on the 
objective function. In Chapter 11, we consider situations where the objective func- 
tion / is defined in a geometrical manner from a random set of N points in the unit 
square. The common objective is to prove that / has Gaussian-like tails. However, 
the richness of the situation is unsuspected beforehand; apparently similar defini- 
tions require rather different levels of sophistication. In Chapter 12, wc provide a 
simple derivation of the free energy of spin glasses at high temperature. Finally, in 
Chapter 13, we discuss how the study of sums of vector- valued independent random 
variables motivated the approach of this paper, and we discuss a few new specific 
results. 

We now comment on the methods of Part I, their history, and compare them 
with competing methods. 

There is a general method, that is becoming increasingly popular, to prove devi- 
ation inequalities for \f — Ef\. (That the mean rather than the median is involved 
is very much irrelevant). It is to decompose / as the sum of a martingale dif- 
ference sequence / = "^di, and to use martingale inequalities. The generality of 
the method stems from the fact that such a decomposition is easy, simply writing 
di = E{f I J^i) — E{F I J^i-i) for any increasing filtration (J^i). This method was 
used in Probability in Banach Spaces (under the name of "Yurinski's method") 



for the study of / = 



where Xi are independent Banach space random 



i<N 

variables (r.v.). The generality of the method was discovered by B. Maurey [Ml], 
and it was further developed in [Mi-S]. It soon became apparent, however, that this 
method would not always yield optimal results; this is what prompted the invention 
of the isoperimetric inequality of [T2] (more details on history are given in Chapter 
12). An inequality very similar to the inequality of [T2], but with a much simpler 
proof, appears in the present paper as Theorem 3.1.1. The phenomenon described 
by this inequality was completely new at that time, and had a major impact in 
Probability in Banach spaces (prompting, in particular, the writing of the book 
[L-T2]). One could reasonably hope that this inequality would find applications 
to other domains; but as of today, this has not been the case. Another inequality 
that was discovered in relation with Probability in a Banach space is a predeces- 
sor of (4.1.3) [Tl]. The inequality of [Tl] did not, however, play a crucial role in 
that theory, because, for most applications, it could be replaced by the Gaussian 
isoperimetric inequality (1.6) to which it is related. For this reason, the discov- 
ery that (4.1.3) was the direction to pursue for applications outside Probability in 
Banach spaces was delayed until very recently. It does not seem possible to prove 
either (4.1.3), or even some of its most interesting consequences we will present in 
Part II through the martingale method. This should not be so surprising, since 
the inequalities of the present paper have been developed precisely to achieve what 
martingales seem unable to attain. Among the results of Chapters 2 to 5, appar- 
ently only those of Sections 2.1, 2.2 can be obtained using martingales; and the only 
reason why these are included here is that they provide an excellent and very simple 
setting to introduce our basic scheme of proof. A major thesis of the present paper 
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is that, while in principle the martingale method has a wider range of applications, 
in many situations the abstract inequalities of Part I are not only more powerful, 
but require considerably less ingenuity to apply. In all the examples we examined, 

only in some rare situations, where the martingale is close to a sum of independent 
r.v., and where the value of numerical constants is crucial (such as [M-H]) did our 
methods fail to supersede martingales. 

We now comment on the method of proof of the inequalities of Part I. Isoperi- 
metric inequalities such as (1.5) or (1.7) are often proved via rearrangements. That 
is, one produces a (simple if possible) way to transform the set ^ in a set T(A), of 
the same measure, but more regular, so that the measure of T{A)t is not more than 
the measure of Af. The procedure is then iterated, in a way that the iterates of A 
converge to the "extremal case" . Rearrangements are the only known technique to 
obtain perfect inequalities such as (1.5), (1.6). The inequality of [T2], that started 
the present line of work was proved using rearrangements. The difficult proof re- 
quires different types of transformations, some of which prevent from obtaining the 
external sets. 

Despite considerable efforts, rearrangements did not yield a proof of the inequal- 
ity of [Tl]. (As pointed out to me by N. Alon, the reason could be the complicated 
nature of the extremal sets.) A completely new method was developed in [T2]. 
The main discovery there was that of a formulation that allows an easy proof by 
induction upon the number of coordinates. The wide applicability of the method 
became apparent only gradually. This method and its variations provide a unified 
scheme of proof of all our inequalities, that, in its simplest occurrence, is described 
in great detail in Section 2.1. Ironically enough, this method is, in its principle, 
rather similar to the martingale method; the extra power is gained from the possi- 
bility of abstract manipulations in product spaces. A considerable advantage of the 
method is that, proving the induction hypothesis reduces to proving certain state- 
ments involving only functions on fl. At times this is extremely easy; sometimes it 
is a bit harder. But certainly the nature of the statements that have to be decided 
is such that they are bound to yield to sufficient effort. What on the other hand, is 
not entirely clear, is why this simple procedure seems so miraculously sharp; in the 
situations where explicit computations of the best possible constants given by the 
method has been possible, these constants have proved very close to the optimal. 
In the cases where only less precise estimates have been possible, these estimates 
appear nonetheless to capture, up to a constant, the exact order of what really 
happens, and this, in every single situation that has been investigated. 

The paper has been written to be read without any knowledge of this author's 
previous work or of the topic in general. For the sake of completeness, the only 
previous result of the author that has not been either vastly generalized or con- 
siderably simplified has been reproduced (as Theorem 4.2.4). Significant effort has 
been made in writing the paper in an easily accessible form. For example, it turns 
out in several situations that the simplest occurrence of a new principle is also the 
most frequently used. In these cases, we have taken care to give a separate proof 
for this most important case. These (short) proofs also serve as an introduction to 
the more complicated proofs of subsequent more specialized results. 

During the preparation of this paper, I asked a number of people whether they 
were aware of recent or potential uses of the martingale method. I am pleased to 
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2. Control by one point 
2.1. The basic principle 

Throughout the paper we will consider a probability space (O, E, fi) and the 
product {fl^ , A*^)- The product probability will be denoted simply by P. 

Consider a subset A of fl^ . For x e fl^ , we measure how far x is from A by 

(2.1.1) f{A, x) = min{card{i < N ; Xi ^ yi} ; y e A} . 

This is simply the Hamming distance from x to A. The reason that we use a different 
notation is that at later stages, we will introduce different ways to measure how far 
X is from A. These ways will not necessarily arise from a distance. 

It should be observed that the function f{A,-) need not be measurable even 
when A is measurable. This is the reason for the upper integral and outer proba- 
bility in Proposition 2.1.1. below. On the other hand, measurability questions are 
simply irrelevant in the study of inequalities. Simple and standard approximation 
arguments show that none of the results of this paper would lose any power if one 
should assume that O, is Polish, is a Borel measure, and that one studies only 
compact sets. It would be distracting to devote space and energy to these routine 
considerations. Therefore, we have felt that it would be better to simply ignore all 
measurability questions, and treat all sets and functions as if they were measurable. 
The reader will keep in mind that in the sequel, when measurability problems do 
arise, certain integrals (resp. probabilities) have to be replaced by upper integrals 
(resp. outer probabilities) just as in the statement of Proposition 2.1.1. (The reader 
who desires to have a proof of our statements without measurability assumption 
should be warned that it does not work to try to extend the proofs we give by 
putting outer integrals rather integrals - the reason being that Fubini theorem fails 
for outer integrals. Rather one has to derive the general result from the special case 
of well behaved sets by approximation.) 

Proposition 2.1.1. Fort > 0, we have 

/* 1 / 1 i _L ~t \ ^ 

e«^.„P(.)<_J_(l + l±^) 



In particular, 

(2.1.3) i^*({/(A-)>^})<p^e-'=Viv. 

As was pointed out in the introduction, the power of our approach largely rests 
upon the fact that it reduces the proof of an inequality in fl^ such as (2.1.2) to the 
proof of a much simpler fact about functions on Q. In the present case, the meat 
of Proposition 2.1.1 is as follows. 
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Lemma 2.1.2. Consider a (measurable) function g on Q. Assume < g < 1. 
Then we have 

(2.1.4) / min ( e*, — ^ ) dfi{iv) I g{uj)dii{u}) < a{t) 

where we have set a{t) = ^| + — ) • 

Proof. If we replace g by max((7, e~*), this does not change the first integral, but 
increases the second. Thus it suflaces to prove that if e~* < 5^ < 1, we have 

1 



d/j, / gd/j, < a{t) 
Q. 9 Jo. 



Consider the convex set C of measurable functions g orvQ for which e~* < (7 < 1- 
On C, the functional g — * j^g~^dfi is convex. On the subset Cb of C that consists 
of the functions with integral b, this functional attains its maximum on an extreme 
point. There is no loss of generality to assume that /j, has no atoms; then it is 
well known that an extreme point of C takes only the values e~* and 1. Thereby it 
suffices to show that for < w < 1 we have 

{1-U + ue*){l -u + tie"*) < a{t) . 

But the left hand side is invariant by changing u into 1 — tt, so that the maximum 
is obtained at w = 1/2, and is a{t). □ 

The proof of Proposition 2.1.1 goes by induction over N. The case N = 1 follows 
from the application of (2.1.4) to ^ = 1^. 

Suppose now that the result has been proved for N, and let us prove it for A/" + 1. 
Consider A C n^+'^ = x n. For uj e n, we set 

(2.1.5) A{uj) ^ {x en^ ; {x,uj) e A}. 

and 

B = {x e ; 3io e n , {x,io) e A} . 



With obvious notations, we have 

f{A, {x,u;))<f{A{u),x). 

Indeed, if |/ G A{lo), then {y,co) e A, and the number of coordinates where {y,u) 
and (xjUj) differ is the number of coordinates where x and y differ. Thus, by 
induction hypothesis, we have 



(2.1.6) / exp{tf{A,{x,uj)))dP{x) < 



ajt)^ 
P{A{u;)) 



We also observe that 

f{A,{x,u))<f{B,x) + l 
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SO that, by induction hypothesis, we have 

Jn- ^ P{B) ' 

and combining with (2.1.6) we get 

Integrating in we have 

e'fiM^^-^))dP{x)dii{uj) < a{tf [ min ( , 1 c//x(a;) . 



To complete the induction, it suffices to show, by Fubini theorem, that 

But this follows from (2.1.4) applied to the function g{aj) = P{A{uj))/ P{B). 
We now finish the proof of Proposition 2.1.1. We note that 

j-2n 

a{t) = l + Y: 



- 2(2n)! 

Now 2(2n)! > 4"'n!. Indeed, this holds forn = 1, n = 2, while if n + 1 > 4, we have 

(2n)! 



Thus 



= (n + 1) ■ ■ ■ (2n) > 4^ . 



lit) <1 + Y1 ^^''/^"n! = exp(tV4) . 



n>l 



Finally, (2.1.3) follows from Chebyshev inequality 



P{{f{A,-) > k}) < e-'^ j e'f^'^'^UPix) 



- P{Af 

for t = 2k/N. □ 

Remark 2.1.3. Consider a sequence {ai)i<N of positive numbers. If we now replace 
(2.1.1) by 

(2.1.7) f{A, x) = inf{^{ai: i < N ; Xi ^ yj: y e A} 
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the proof of Proposition 2.1.1. shows that 

(2.1.8) / e*^(^'^)dP(a;) < -^e' ^^<^ ""^^^ 
and, by Chebyshev inequahty 

(2.1.9) P{{f{A, .) > u}) < p^f~^^^ ^^^^ ■ 

A number of inequahties presented in Chapters 2 to 5 allow extensions that 
parallel the way Remark 2.1.3 expands Proposition 2.1.1. These extensions are 
immediate, and will not be stated. It should be pointed out, on the other hand, 
that no gain of generality would be obtained in Proposition 2.1.1. by replacing the 
product , by a product H ^ jJ^i- This comment also applies to many 

i<N i<N 

inequalities that we will subsequently prove. 
2.2. Sharpening 

Having proved (2.1.2), it is natural to wonder whether this could be improved by 
allowing another type of dependence of the right-hand side as a function of P{A). 
The most obvious choice is to replace P{A)~^ by P{A)~'^ for some a > 0. 

Proposition 2.2.1. For t >0, we have 
(2.2.1) / e'«^."dP(x) < 

where 

a" (e* - e-*/")i+" 



(2.2.2) a{a,t) 



(a+l)«+i (1 - e-*/«)(e* - 1)^ 



Proof. Following the scheme of proof of Proposition 2.1.1, (2.2.1) holds provided, 
for each function < < 1 on Q, we have 

min ( e* , — J d/x ( /" gd^] < a{a,t) . 



Following the proof of Lemma 2.1.2, we see that we can take 
(2.2.3) a{a,t)= sup (1 + w(e* - 1))(1 - w(l - e"*/"))'* , 

0<M<1 

from which (2.2.2) follows by calculus. □ 

Certainly neither the author nor the reader are enthusiastic about the prospect 
of using (2.2.1) and optimizing in Chebyshev inequality. The purpose of the next 
result is to obtain a more manageable bound, that also makes clearer the gain 
obtained by taking large values of a. 



CONCENTRATION OF MEASURE AND ISOPERIMETRIC INEQUALITIES 15 



Lemma 2.2.2. 



( 1 

a(Q:, t) < exp — I 1 H 

8 V a 



Proof. Interestingly, rather than using (2.2.2), it seems simpler to go back to (2.2.3) 
and to show that, whenever < w < 1, we have 

+2 / 1 

(1 + «(e* - !))(! - u{l - e"*/"))" < exp - 1 + - 

8 \ a 

or, equivalently 

(2.2.4) log(l + u{e' - 1)) + alog(l - u{l - e"*/")) <^^(l + -) . 

8 y (X J 



Since (2.2.4) holds for t = 0, it suffices to show that the derivative of the left-hand 
side is bounded by the derivative of the right-hand side for t > 0, i.e.. 



t > 
or, equivalently 
(2.2.5) t > 



ue 



ue 



-t/a 



l + u{e*-l) l-'u(l-e-*/«) - 4 CK ,' ' 



u-1 



u-1 



l + u(e*-l) l-'u(l-e-*/") - 4 ("^^ a ' ■ 



Again (2.2.5) holds for t = 0. So it suffices to show that for t > 0, the derivative 
of the left-hand side of (2.2.5) is bounded by the derivative of the right-hand side; 
or, equivalently, that 



u{l — u) 



1 

+ - 



J— t/a 



(1 — u + ue^y a {1 — u + ue~*/°')'^ 
Now, using the inequality Aab < (a + b)'^, we see that 



1 1 
4 4a 



u{l — u)e^ ^ 1 



u{l - w)e-*/" ^ 1 



(1 - w + 'ue*)2 - 4 ' (1 -'u + 'ue-V«)2 - 4 ■ 



□ 



Corollary 2.2.3. For t > 0, we have 

(2.2.6) J e'f(^'^UP{x) < ^ exp iV^ + . 



In particular, for k > ^ ^ log ppj; we have 



(2.2.7) P{{f{A, .) > /c}) < exp I -| I - J ^ log -^1 ^ 
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Proof. Certainly (2.2.6) follows from (2.2.1) and Lemma 2.2.2. Optimization over 
t in Chebyshev inequality yields 



It is an interesting fact that (2.2.7) is exactly the best bound that has been 
proved on P{{f{A, ■) > k}) using martingales (see [McD]). It is a natural question 
to wonder whether, when P{A) > 1/2, one indeed has 



for some universal constant K. More or less standard arguments (e.g., those con- 
tained in [T2]) show that it suffices to consider the case where O = {0, 1}, where P 
is the product of measures {iJii)i<N on fi, and where A is even "hereditary". The 
case where /ii({l}) = 1/2 for each i < N is known, as a consequence of more precise 
results, such as Harper's inequality. Intuitively, this is the worst case. 

Having obtained (2.2.6), one must wonder whether further improvements upon 
(2.2.6) are possible by considering yet other general dependencies of the right-hand 
side as a function of P{A). The reader who wishes to truly penetrate this paper 
will convince himself that this is not the case. 

2.3. Two point space 

Let us now consider the case where Q, = {0,1}, and set p = //({I}), so that 






yields (2.2.7). 



□ 




M{0}) = l-p. 



Proposition 2.3.1. For t >0, a > 1, we have 



(2.3.1) 




where, for p > 1/2, we have set 



(2.3.2) 



b{a,t,p) = {{1 - p)e +p){p+{l-p)e 



t/a\a 



and, forp < 1/2 



(2.3.3) b{a,t,p) = b{a,t, 1 - p) = {{1 - p) 



e"*+p)(p+(l-p)e*/") 
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Proof. Following the proofs of Propositions 2.1.1. and 2.2.1 it suffices to show that 
for any function < < 1 on we have 



/ 



As in the proof of Lemma 2.1.2., we reduce to the case where g > e~*/". Setting 
a — g{0)-, b — S'(l), it suffices to show that, for e~*/" < a, 6 < 1 we have 

(i^-p)^ + {{l-p)a+pbr < b{a,t,p) . 

Setting X — b/a, it suffices to show that 

e-*/'* <x< e*/'' ifi{x) < b{a,t,p) 

where we have set 

(p{x) = ((1 -p)x°' +p) y +p 



Now, 

^'(x) = ap{l -p) [x--^ - 1) ' 

so that decreases for a; < 1, increases for a; > 1. 
Also, we have 



v'{x) - ^ = ap{i -p) [1 - {{1-p) +pxr-^ - ((1 -p)x+pr-^ 



), 



so that, for a; > 1, this has the sign of 2p — 1. Thus for p < 1/2, (p attains its 
maximum on the interval [e~^/",e*/"] at the right end of this interval, while for 
p > 1/2 it attains its maximum at the left end. (One should observe that changing 
a; in 1/a; and p in 1 — p leave (p invariant.) □ 



A particularly important example is when 

A = i a; = (a;,) e {0, 1}^ ; ^ x^ < k 



i<N 



The use of (2.3.1) for this set and of Chcbyshcv inequality will in particular produce 
bounds for the tails of the binomial law. Thereby, it is not surprising that the 
computations involved in the use of (2.3.1) do run into the same type of difficulties 
as those involving the tails of the binomial law. We now show how, nonetheless, 
some simple and reasonably sharp results can be deduced (for general sets A) from 
(2.3.1). The reader will observe that the bound (2.3.1) is (of course) invariant when 
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p is replaced by 1 —p, so that there is no loss of generality to assume p > 1/2. Let 
us fix p, Q! > 1, and consider 

fit) = log5(a,t,p) = log((l -p)e* +p) +alog(p+ (1 -p)e"*/°) . 

Thus /(O) = 0, and 

fit) = il-p) - peV« + (l-p)) ■ 

Thus /'(O) = 0, and 

f"{t)^pil-p)ihie-') + -hie'/^)) 

a 

where 

hix) ^ ^ 



(pa; + 1 — (1 — (1 — a;)p)2 

Simple computations show that when x > 1/e, we have \hix) — 1| < K\x — 1| for 
some universal constant K. It follows that 

t<l^ f"it)<pil-p) + +4i^^) 

and, by integration, that 

t<l^ fit) <Pi^-P)[[^ + l)^^ + Kt'^ ■ 
Thus, we have shown the first half of the following. 
Corollary 2.3.2. For a > 1, < t < 1, we have 



(2.3.4) Je'f^^'-^dPix)<^^expN p{l - p) (^1 + + Kt^ 



In particular, for 



I \ 1/2 



4p(l-p)iVlog-p^ ) <k<pil-p)N 



we have 

i2.?,B\{fiA,x)>k}) 



2 



< exp I - ( fc - W 2j?(l - p)N log — ^ I + 



2pii-p)N\ y ^'(^) / (p(i-p))^A^M ■ 

To obtain (2.3.5), one proceeds as in the proof of (2.2.7), using first Chebyshev 
inequality for t = ^^^^ (i+a)N ^ ^^^^ taking 

a= -1 + 



2p(l-p)Ariog^ 

It is of interest to compare the bound (2.3.5) with the isoperimetric inequalities 
obtained in [Lea]; these isoperimetric inequalities are optimal, but apply only to 
special sets (the so called hereditary sets). The bound (2.3.5) is more general, and 
provides estimates of essentially the same quality. 

We now turn to a rather different situation. Beside the measure we consider 
on Q another probability fXi, with pi = /ii({l}) > p, and we set Pi = fx^ . 
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Theorem 2.3.4. For a subset A of Vt^ , and x e , we consider 
f{A, x) = mm{card{i < N ; Xi = 1 , yi = 0} ; y & A} . 

Then, for t > 0, 

(2.3.6) / e*/(^-)dP(x) < 

where 

a{a, t) = max(l, {l-p + pe*)(pie-*/" + 1 - pi)"") . 

Comment. The reaUy new phenomenon here is that for small t, one has a{a, t) = 1. 
In particular, if a = 1, this occurs whenever e* < pi{l — p)/p{l — pi) so that one 
has 

The remarkable feature about this statement is that it is independent of (and 
so is in essence an infinite dimensional statement). This is the first of the results 
we present that apparently cannot be obtained via martingales (so it deserves to 
be called a theorem rather than a proposition). The reader that would like to gain 
intuition about the phenomenon captured by Theorem 2.3.4 should consider the 
case where A = {x e {0, l}'^ , ^ order to have Pi{A) of order 1/2, 

i<N 

one takes n equal to Npi, assuming for simplicity that this number is an integer. 
Observing that 

f{A, x) > k ^ ^Xi > n + k ^ Npi + k ^ Np + {k + N{pi - p)) 

i<N 

the quantity P{{f{A, x) > k}) can be estimated through the tails of the binomial 
law; the most interesting values of are such that N{pi — p) k. 

The induction scheme of Proposition 2.1.1. will reduce Theorem 2.3.4. to an 
elementary two-point inequality, that is the object of the next lemma. 

Lemma 2.3.5. If a < b < 1, we have 
(2-3.8) ^+Pnim -,- < ^ > 



Proof. If we set x = min (^ , e*/") , we are reduced to show that 

l<x< e*/'' ^ i^{x) < a{a,t) 

where 

(p{x) = (1 -p + px"^) + {l-pi)Y < a{a,t) 
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But (p'{x) has the sign of p{l — pi)x°'~^^ — Pi{^ ~ p)i so it is negative for values of 
X close to one, and then, possibly, becomes positive. Thus ip attains its maximum 
on the interval [1, e*/'*] at one of the endpoints. □ 

Proof of Theorem 2.3.4- We proceed by induction over A^. For N = 1, since 
/(A, Lo) = when 1 e A, it suffices to consider the case A = {0}, in which case the 
result follows from (2.3.8) with a = 0, 6 = 1. 

Assuming now that the theorem has been proved for N, we prove it for A/" + 1. 
Consider A C fl^'^^, and set Ai = {x E fl^ ; {x, 1) e A}. Consider the projection 
S of ^ on Q^. We observe that 

/(A ix,u))<l + fiB,x) 

/(A {x,oj))<f{Ai,x) 

so that setting a = Pi(^i), b = Pi{B) and using the induction hypothesis, the 
result follows from (2.3.8). □ 

2.4. Penalties, I. 

A (somewhat imprecise) way to reformulate (2.1.1) is that we measure how far x 
is from A by simply counting the smallest number of coordinates of x that cannot be 
captured by a point of A. Rather than just giving a penalty of 1 for each coordinate 
we miss, it is natural to consider, given a non-negative function /i on f2 x the 
quantity 

(2.4.1) fh{A,x) = mi{ J2h{xi,yi)l 

i<N 

To simplify the notations, we will assume 

(2.4.2) yxen, h(x,x) = 
so that (2.4.1) becomes 

(2.4.3) fhiA, x) = inf I ^ /i(a;„ y^) ; y e A 

\^i<N 

Concerning (2.4.2), we should point out that we will let x, y denote points in 
as well as points in Q; when there is too much danger of confusion, however, 
points of Q will be denoted by a;, a;'. 

The function h will always be assumed to be measurable. The following simple 
result is already useful, as will be demonstrated in Chapter 11. 

Theorem 2.4.1. For each measurable subset A of Vt^ , and each t > for which 
J J expth{x,y)diJ,{x)diJ,{y) < oo, we have, setting v{u>,u>') — 'max{h{u>,u>'), h{u>' 
that 

{2.A.j) e*^'^(^'^)dP(a;) < (e*^('^''^')+e-*^('^''^'))d//(a;)d//(^^^^ . 



The crucial point of Theorem 2.4.1 is as follows. 
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Proposition 2.4.2. Consider a function g > on fl, and set 

(2.4.5) g{x) = mUg{y) + th{x,y)). 

yen 

Then 

(2.4.6) J e^d/i J e-^di^<^ J (e*"^'"''"') + e-*"^'"''"'^)c//x(a;)d/x(a;') • 

Indeed, a simple truncation argument shows that Proposition 2.4.2 remains true 
if one allows (using obvious conventions) g to take values in M+ U {oo}. To prove 
Theorem 2.4.1. by induction over N, considering a subset A of Q^'^^, for a; e f2 
we set 

A{u) = {x G O^; (x,w) G A}, 

and we define g by P{A{uj)) = e~^^'^K It should then be clear that (2.4.6) is exactly 
what is needed to make the induction work. 

Proof of Proposition 2.4-2. For simplicity we assume g measurable. Then the left- 
hand side of (2.4.6) coincides with 



f I ^i^)-9iy)dii{x)diJi{y) . 



We set u{x,y) — g{x) — g{y). By definition of g, we have g{x) < g{y) + th{x,y). 
Since h{x, x) = 0, we also have g{x) < g{x). Hence 

(2.4.7) u{x,y) <th{x,y) ; u{x,y) < g{x) - g{y) 
We now observe that for two numbers a, 6, if a + 6 < 0, then 

e'* _)_ e'' < gmax(a,6,0) _|_ g- max(a,6,0) _ 

Thereby, by (2.4.7), we have 

^u{x,y) _|_ ^u{y,x) ^ ^tv{x,y) _|_ ^-tv{x,y) ^ 

The result follows by integration.D 

It is of interest to get simpler bounds. Let us observe the following elementary 
fact (that is obvious on power series expansions) 

(2.4.8) The function a;-2(e^ + e-^- 2) increases for a; > . 
Thus, for t < 1 

^tv ^ g-tt, _ 2 ^ + e"" - 2 

and hence 

(2.4.9) < 1 + - (e^ + - 2) . 

We note that, for an increasing function 99, 

(/7(max(a, b)) < max((/?(a), v'(6)) < ^{a) + f{b) . 

Using this for (fi{x) = + — 2, a = h{uj, uj'), b = h{(jj', a;), using then (2.4.9) 
and integrating, we get the following from (2.4.4). 
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Theorem 2.4.3. // 

(2.4.10) JJ ex.ph{x,y)dn{x)dn{y) < oo , 
we have for t < 1, 

(2.4.11) / e'f^^^'^'^dPix) 

< exp (^Nf j j {e^^^''^') + e-'*^'^''^') - 2)dii{uj)dn{uj') 

Corollary 2.4.4. Assume 

(2.4.12) / / exp h{x, y)d^{x)dii{y) < 2 



Then for all u < 2N we have 

(2.4.13) P{{fH{A, .) > u}) < p^e-"^/^^ 



Proof. Since e < 1, under (2.4.12), the right-hand side of (2.4.11) becomes 
bounded by P{A)~^ exp Nt^, from which (2.4.13) follows by Chebyshev inequality.D 

The following resembles Bernstein's inequality. 

Corollary 2.4.5. Assume that ||/i||oo = sup h{x,y) is finite. Then 

(2.4.14) PiifkiA,-) > u}) < -^exp f-min ( 



p{A) ^ V \mh\\i ' 2 

where we have set \\h\\2 — (//q2 h^{oJ,uj')diJi{uj)diJi{uj')Yf'^ . 

Proof. By homogeneity, we can replace h by h' = h/\\h\\oo- For x < 1, by (2.4.8), 
we have -|- — 2 < x^(e + — 2) < 2x^. Thereby the right hand side of 
(2.4.11) becomes bounded by ^(A)"^ exp2A^t^||/i||2, from which the result follows 
by Chebyshev inequality. 

Remark. The reader has possibly observed that we have made no special efforts to 
get sharp numerical constants (in contrast with the previous sections) and we have 
used the simplest estimates, however crude. This feature will occur repeatedly. For 
a number of the results we will present, the proofs do not seem adapted to the 
obtention of sharp constants. Thereby, there is actually no point to track explicit 
values of the numerical constants involved. Throughout the paper, K will denote 
a universal constant, that may vary at each occurrence. 

2.5. Penalties, II. 
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It should be apparent from (2.4.1) that fh depends on h only through the prop- 
erties of the following functional, defined for subsets B oi Q 

(2.5.1) h{uj, B) = ini{h{u, u') • uj' e B} . 

(The reader should carefully compare this definition with (2.4.3) and note that in 
both cases the infimum is taken over the second variable.) 

Thereby, one should expect that the exponential integrability of h can be replaced 
in Theorem 2.4.1. By a weaker condition on the functional h{x, B). This is indeed 
the case. 

Theorem 2.5.1. Assume that for each subset B of VL we have 

(2.5.2) / exp 2h{x, B)dii{x) < — ^ . 

Then, for each subset A of fl^ , and each < t < 1, we have 

(2.5.3) / e'f^^'^'''^dP{x) < 



P{A)- 



Discussion. 1) It is good to observe and keep in mind that by Holder's inequality, 
we have for a < 1 



(2.5.4) J e"^d/x < ( j e^dnY 

Thus, the precise value of constants such as the constants 2, e that occur in (2.5.2) 
is completely irrelevant. Actually we will use the following consequence of (2.5.2): 



(2.5.5) / e^^h{x,B)dix{x) < 

Jq 



2) It is very instructive to compare (2.5.2) with a condition such as (2.4.10). 
Indeed, under (2.4.10), we have for all x 

(2.5.6) iJ,{B) exph{x, B) = jJi{B) inf exp /j,(a;, j/) < / exph{x,y)diJ,{y) . 

Integrating in x gives 

exp h{x, B)dfi{x) < ——— [f exp h{x,y)diJ,{x)diJ,{y) . 



KB) 



n2 



Thus (with the exception of the largely irrelevant factor 2), (2.4.10) appear stronger 
than (2.5.2). It is indeed much stronger, a fact that is not surprising in view of 
the crudeness of (2.5.6). To get a concrete example, consider the case where O 
is itself a product of m spaces (and /x a product measure), and denote by f{x,y) 
the Hamming distance in fl. Then Proposition 2.1.1. asserts that the function 
h = m~^l'^f satisfies (2.5.2). On the other hand (except in trivial cases) the 
function //a will fail (2.4.12) unless a is of order m. 

To prove Theorem 2.5.1, the induction method reduces to the proof of the fol- 
lowing. 
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Proposition 2.5.2. Consider < t < 1, and a function g > on fl. For s > 0, 
we set Bg = {g < s}, and we consider 

(2.5.7) g(x) = inf s + th(x, Bg) . 

8>0 



Then under (2.5.2) we have 

(2.5.8) j e^dfi J 



2 



Proof. We observe that 

(2.5.9) g{x)-g{y)<th{x,Bg^y)). 

We then follow the argument of Proposition 2.4.2, using (2.5.9) rather than the 
first part of (2.4.7). Combining with the argument of Theorem 2.4.3, we are led to 
show that 

e^^'''^o(y)^dnix)dniy) <4. 
Using (2.5.5) and Fubini theorem, it suffices to show that 

(2.5.10) / , .\ M y)<2. 

The best way to prove this inequality is to observe that the left-hand side depends 
only on the function s — > iJ,{Bs). Thus there is no loss of generality to assume that 
Q = [0,1], that fx is Lebesgue measure, and that g is nondecreasing. But then 
l^iBg^y)) > y, and y'^/^dy = 2. □ 

As pointed out in the discussion, a natural application of Theorem 2.5.1. is 
to the case where O is already a product space. This will be used implicitly, but 
crucially in Section 11.5. To formulate in words what happens. Proposition 2.1.1. 
states that if A is a subset of a product of N spaces, of measure 1/2, all but 
exceptional points x of 0,^ are such that there is a point in A that captures all 
but about VN of their coordinates. Suppose now that N = N1N2, and we think 
of the N coordinates as Ni blocks of N2 coordinates. Then, using Theorem 2.5.1, 
we know that (for but exceptional points x) not only we will find a point in A that 
misses only about coordinates of x, but these coordinates will be concentrated 
in only about ^/Nl blocks. An interesting question would be to quantify precisely 
what can be said when, rather than considering only two "levels" , one considers a 
large number of levels. 

2.6. Penalties, III. 

In this section, we explore a new phenomenon, that will also be met in Sections 
3.3.3 and 4.4.4. The notations of the present section will be used throughout the 
paper. Roughly speaking, what happens is that if, in (2.5.3), one allows a more 
general type of dependence on P{A) of the right hand side, then a weaker condition 
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than (2.5.2) will suffice; this will mean in practice weaker integrability requirements 
on h. 

The dependence in P{A) we will consider will be of the type e^*^^^^-*-*. Through- 
out the paper, 9 will denote a convex decreasing function from ]0, 1] to M""", such 
that ^(1) = 0, lim d{x) = oo. The most important example is d{x) = — logo;, in 

a;— >0 

which case e^^^^"^^^ is the familiar quantity 1/P{A). We will always denote by ^ the 
inverse function of 6, so that ^ is a convex function from W'^ to ]0, 1], with ^(0) = 1. 
We will always assume the following 

(2.6.1) decreases ; V6 > , ^"(b) < \^'ib)\ 

For X G M, we set a;"*" = max(a;,0), and we will keep the following notation, for 

X eR.be 



(2.6.2) E{x, b) = ax+) - m - {x+ - b)^\b) 



We denote by A Lebesgue measure on [0, 1]. The measure of a Borel set B is 
simply denoted by \B\. 

Central to this section is the following technical condition, that relates ^ and a 
function w >0 defined on [0, 1]. 

Condition H{^,w). 

(2.6.3) V6 > 0, Vt, < t < 1, / E(b-tw{u),b)dX{u) <t'^\^'(b)\. 

Jo 

First we will investigate conditions that imply (2.6.3) under two simple choices of 
^. Then we will look at a rather general situation where the meaning of (2.6.3) 
can be considerably clarified; and before stating the main result (Theorem 2.6.5) 
we will prove a technical lemma that will explain the precise purpose of condition 
H{^,w). 

Proposition 2.6.1. When^{x) — , condition H{^,w) holds provided J e^dX < 
2. 



Proof. Indeed, we have 

E{x, b) = e-^^ - e-^ + {x+ - b)e-^ 
< e-^ - e-^ + ix- b)e-^ 
= e-^(e-^^-'') + (x-6)-l) 

Thus (2.6.3) holds provided 



t<l^ J (e'"' -tw- l)dX < t^ . 

But, since the function x~^{e^ —x — 1) increases for x > 0, we have (e*"' —tw — 1) < 
t^ie'^-w-l). □ 
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Proposition 2.6.2. If^{x) = then condition H{^^u3) holds provided J w^dX < 
1. 

Proof. Setting y = x'^ , we have 

"^"^''^ y+1 6+1 + (6+1)2 

- <{y-br\e{b)\. □ 



(y+l)(6+l)^ 

One obvious consequence of (2.6.3), taking t = 1 is that 

(2.6.4) \{w>b}\E{0,b)<\e{b)\. 

In practice for b large 5(0, b) is of order 1; so (2.6.4) is really a tail condition. The 
next result shows that a condition of a similar nature is indeed sufficient, provided 
^' varies smoothly (i.e., satisfies the A2- condition; which is not the case when 

Proposition 2.6.3. Assume that for a certain number L > 0, we have 

(2.6.5) V&>0, Vt<l, |^'(A)| <Lt2|^'(6)|. 
Then (2.6.3) holds provided the following two conditions hold: 



(2.6.6) J w'^dX < J 



(2.6.7) V6>0, \{w>b}\<^\eib)\ 



2L' 



Proof. We write 



(2.6.8) [E{b-tw,b)dX< [ E{b - tw,b)dX+ \{tw > b/2}\ . 

J J{tw<b/2} 

By Taylor's formula, since ^" decreases, and ^"{b) < \^'{b)\, we have, by (2.6.5) 

x>b/2^Eix,b)<^-^^e Q 



2 



Thus 



/ E{b-tw,b)dX<^\^'{b)\[w^dX. 

J{tw<b/2} 2 J 
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Also, by (2.6.7), (2.6.5) 



\{tw>b/2}\<^ 



a- 



The result follows, combining with (2.6.8) . □ 

The reader should observe that the functions ^{x) — (1 +x)~" {a > 1) satisfy 
(2.6.5). 

The following lemma explains the purpose of condition H{^,w). 

Lemma 2.6.4. Consider a function f > on Q. Assume that for a certain t, 
< t < 1 and all s < b we have 

(2.6.9) //({/ < s}) < \{tw >b-s}\ = \{b-tw <s}\. 

Then under condition H{$^,w), for each set C we have 

(2.6./o)e(/)d// 
Jc 

<mm+eib) [ if-b)dfi+t^\e{b)\ + le{b) [ if-brdfi. 

Jc ^ Jcn{f>b} 

Proof. By definition of S, (2.6.10) is equivalent to 

/ Eif,b)d^^<t'\^'ib)\ + k"ib) I if-b)'dfi. 
Jc ^ Jcn{f>b} 

By Taylor's formula, and since ^" decreases, for x > 6 we have 

1 



and thus 



Eix,b)<^-ix-bn"ib) 



I E{f,b)d^<le{b) [ u-bfdij. 

Jcr\{f>b} ^ Jcn{f>b} 



If we remember that S > 0, and if we use condition H{^, w), we then see that it 
suffices to show that 

(2.6.11) / E{f,b)dii< E{b-tw,b)dX. 

J{f<b} J 

Now, (2.6.9) implies that for all s < 6 we have 

K{f<s})<\{b-tw<s}\. 
Thus, since E{x, b) decreases for x < b, we have, for all ^ > 

//(1{^<6}S(/, b)>z)< \{E{b - tw, b)>z}\, 
from which (2.6.11) follows. □ 
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Theorem 2.6.5. Consider a function h on VL x Vt, and a nonincreasing function 
w on [0, 1] such that J w^dX < 1. Assume that for each subset B of Vt, we have 

(2.6.12) / exp/i(x,S)c///(a;) < exp(«;(//(S))) 

JQ. 

where we keep the usual notation h{x, B) = mf{h{x, y) ; y E B}. Consider a func- 
tion 6 as usual, and assume that the condition H{(, w) holds. 

Then, for each subset A of , and all t < 1, we have, for allt<l 
f e^f^^^'^'UPix) < exp(4Art2 + ^(P(yl))) . 

To understand better (2.6.12) it is of interest to specialize to the case where 
h depends only on x (resp. y). If h depends on x only, (2.6.12) means that 
J^exph{x)diJ,{x) < expt(;(0). If h depends on y only, then (2.6.12) becomes 

M{h{y) ■,yeB}< w{f,{B)) . 

Taking B = {h> s}, we get s < w{n{{h < s})) and, since w in nonincreasing, this 
implies 

l^{{h > s}) < \{w > s}\ . 

It is easy to see that, conversely, this implies (2.6.12) when h depends upon y only 
and when w is left continuous. 

To prove Theorem 2.6.5, it suffices, using induction over AT, to prove the follow- 
ing. 

Proposition 2.6.6. Consider a function g on Q, < g < 1, and set 

eg{x)^ inae{g{y))+th{x,y)}. 
yen 

Then, under the conditions of Theorem 2.6.5, for t < 1, we have 

J e^^dfx < exp(4t2 + 9{J gdfj,)) . 

Clearly, this is equivalent to the following. 
Proposition 2.6.7. Consider a function f on Q, f > 0, and set 

f{x)=ml{f{y) + thix,y)}. 

Then, under the conditions of Theorem 2.6.5, for t < 1, we have 

(2.6.13) J e^dn< exp{5t^ + 9{J C{f)dl^)) ■ 
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Proof. The problem is that we have on the right of (2.6.13) the quantity 9{j ^ {f)diJ,) 
rather than the larger quantity / fdfx. We consider t as fixed through the proof. 

Step 1. We set ^ {f < s} for s > 0, and 

b = mi{s + tw{fM{Bs))}. 

s 

We note that f{x) < f{x). We consider the function /' given by 

fix) = fix) if fix)>b 

fix) = b if fix) < b < fix) 
fix) ^ fix) if fix)<b. 

Since / < /, we have f < f < f. Thus / efd/i < J e^' d/i, and < ^{f), so 
imdfi < jaf')df^ and OiJ af)dfi) < eij^if)dfi). Thereby, it suffices to prove 
that 

(2.6.14) /e^'.,<exp(5*^+«(/«/')*)). 

Step 2. By definition of b, for s < b, we have 

twiniBs)) >b-s. 
Since w is nonincr easing, we have 

(2.6.15) \{tw>b-s}\>i^iBs). 

Since fix) = fix) when fix) < 6, we see that (2.6.9) holds (for /' rather than 
/). Since f — f when fix) > 6, by (2.6.10) used for C = f2, we get, since 

e'{b) < 

i2.6.lf)af)d^^<m + e{b) J{f-b)d^^+Wib)\it'+l J ^ Jf - b)'d^^) . 



Step 3. If J/ e -Bs, we have 

fix) < fiy) + thix, y)<s + thix, y) 

so that 

fix) <s + thix, Bs) . 

Thus, by (2.6.12) we have 

J e^~'^ i/i < exp(t-^(s + twijiiBs)))) . 
Taking the infimum over s yields 
(2.6.17) J e'~'^^-''Uii<l. 
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Since e^^ < 1 + e^, we get 

(2.6.18) j e^~'(f-^^^diJi<2. 

Step 4. The inequality > 1 + for x > 0, and (2.6.18) show that 

/ ((/ - b)+fdfx < 2t^. Combining with (2.6.16), we get 

(2.6.19) j af)dl^ < m + J (/' - b)dfi + 2t^\e{b)\ . 

The convexity of 6 imphes that 9{x) > 6{y) + {x—y)9'{y). Also, since 9{^{x)) = 1, 
we have e'{^{b)) = 1/C'{b). Thus (2.6.19) implies 

(2.6.20) e{ J af')dii) >b + J{f- b)d^ - 2t^ 

= J rdii-2t\ 

Step 5. To finish the proof, it is thereby sufficient to show that 

(2.6.21) j e^'dii < exp(2t2 + J fd^) . 

Consider the function r(x) = — a; — 1, so that 

J e^'-^dti = 1 + j{f' - b)dn + j r{f' - b)diJi 
< exp( j if - b)dfi + J r{f - b)dfi) 

and thus it suffices to show that J r{f' — b)dji < 2t^. We observe by (2.6.18) that 

/ T{t-\f' -b))d^l<l 

J{f'>b} 

and, since as already observed, the function x~'^r{x) increases for a; > 0, this implies 

/ r{f - b)dfi < e . 

J{f'>b} 

Also, it is elementary to see that r{x) < x'^/2 for a; < 0. Now, by (2.6.15), we have 

/ (/' - bfdfx <t^ I W^dfX <t^ . □ 
Jif'<b\ J 



2.7. Penalties, IV. 
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This section is devoted to remarkable fact that if (2.5.2) is suitably reinforced, 
the term ex.pt^N can be removed in (2.5.3). 

To express conveniently the conditions we need, we introduce the function c(a, t), 
defined for < a < 1, t > 0, as follows (c stands for concentration): if i/i is 
the measure on R of density |e~l^l with respect to Lebesgue measure, we have 
c(a, t) = 1^1 ((—oo, b+t]) , where b is given by a = ^'i((— oo, b]). Simple considerations 
show that 

I 

a > - =^ c(a, t) = 1- e~*(l - a) 
a < - , e a < - =^ c(o, t) = e a 
a < - , e a > - =j> c{a,t) = 1 . 

Theorem 2.7.1. Assume that for each subset B of Q we have 

(2.7.1) t<l^ l^{{h{; B) < t^}) > c{^{B),t) 

(2.7.2) t>l^ f^HH-, B) < t}) > c{^{B),t) . 

Then, for each subset A of , we have 

(2.7.3) J e^~' f'^^^'''UP{x) < -pi^ 

where fh is given by (2.4-3) and where K is universal. 

Our first task should be to give natural examples of situations where (2.7.1), 
(2.7.2) occur. 

Proposition 2.7.2. Consider the probability i>i onR, of density ^e~^^^ with respect 
to Lebesgue measure. Then the function h{x,y) — min(|a; — y\, \x — j/p) satisfies 

(2.7.1) , (2.7.2) (forPi, rather than ii) . 

Proof. For a subset C of M, and t > 0, let us set Ct = {x E R; d{x, C) < t}. To 
prove (2.7.1), (2.7.2), it suffices to show that 

z^i(Ct) >c(z/i(C),t). 

This is proved in [T4] using rearrangements. 

We sketch below a simpler alternative argument to prove the weaker result 

(2.7.4) yi{,Ct)>c{vi{C),t/2). 

(The reader should observe that this suffices to prove that h/A satisfies (2.7.1), 

(2.7.2) .) 

First, we reduce to the case where C is a finite union of intervals. Setting 

u{t) =inf{|a;|; a; e Q/CJ , 
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it should be clear that 

(2.7.5) ^(a)>iexp(-^t)). 

By definition of u{t), we see that the interval [—u{t), u{t)] is either contained in 
the closure of C^, or else it does not meet Ct- Thereby, we have either 

ui{Ct) > 1 - 2u^{[u{t),oo)) = 1 - e-"W 

or else 

i^i(Ci)<2zyi([zi(t),oo)) = e-'^W 

so that, in any case 

e-"W >min(^/i(Ct),l-z/i(a)). 

Combining with (2.7.5) shows that as long as i'i{Ct) < 1/2, we have ^(log i^i(Ct)) > 
1/2, so that z^i(Ct) > e*/^z^i(C). Similar considerations complete the proof. □ 

Other examples can be generated using Proposition 2.7.2 and the following simple 
observation. 

Proposition 2.7.3. Consider a probability space {Q,iJ,), a function h on Q^, that 
satisfies (2.7.1), (2.7.2). Consider a measurable map r/ from Vt to a measured space 
Q.' , and the measure fj,' = rj{ii) on Q' . Consider a function h' on Q.'^ such that 

(2.7.6) Va;,y e f], h' (n{x),r]{y)) < h{x,y) . 
Then h' , ji' satisfy (2.7.1), (2.7.2). 

Proof. This is obvious using the relations ii{r]~^{B)) = IJ,'{B), h{x,r]~^{B)) > 
h'{ri{x),B). 

The use of Propositions 2.7.2 and 2.7.3 will allow the construction of a wide class 
of examples. 

Proposition 2.7.4. Consider a convex symmetric function '0 > on M, with 
lim ip'{x) = oo, and the probability of density a^e~'^^^^ with respect to Lebesgue 

x—^oo 

measure, where is the normalizing constant. Then there is a constant K{'^) 
depending on only such that the function h{x, y) on given by 

(2.7.7) \x-y\<l^ h{x,y) = ^^\x-y\^ 

(2.7.8) \x-y\>l^ h{x, y) = " V 
satisfies (2.7.1), (2.7.2) with respect to v^. 

Proof of Proposition 2.7.4-- Consider the nondecreasing map r] from M to M that 
transports vi to v^. Thus 



/■oo /"OO -| 

(2.7.9) / av,e-^WcZA(t) = / -e'^'^dXit) 

J'n{x) Jx ^ 
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By Propositions 2.7.2, 2.7.3, it suffices to show that 

(2.7.10) h{rj{x),rj{y))<mm{\x-y\,\x-y\^). 

It is simple to see that (2.7.10) will follow from (2.7.7), (2.7.8) (with a suitable 
choice of the constant there) provided we can show that 

(2.7.11) \r]{x)-v{y)\<K{i;)\x-y\ 
(2-7.12) i; (^.^\rj{x) - rj{y)\^ < \x - y\ . 

There, as in the rest of this proof, K{iIj) denotes a constant depending on i/j only, 
that may vary at each occurrence. 

To prove (2.7.11), it suffices to prove that r]'{x) is bounded when x > 0. Dif- 
ferentiating (2.7.9), we get a-ipr]' {x)e~'^^'^^''^'> = e~^/2, and plugging back in (2.7.9), 
we get 

/•oo 

Thereby, it suffices to show that 

nOO 

sup / e-^(^^+^^"^dt<oo. 

u>0 Ju 

Given uq > 0, the supremum for u < Uq is certainly bounded. On the other hand, 
for u > Uq, by convexity of ip we have '0(t) — V'(f^) ^ (^ ~ {uq), so it suffices to 
choose Uq with iI)'{uq) > 0. 

We now turn to the proof of (2.7.12). It suffices to prove that, for y > x > 0, we 
have ip{{r]{y) — 'r]{x))/ K{il:)) < y — x. Setting a — il^~^{y — x), it suffices to show 
that r]{x + ip{a)) < r]{x) + K{'ip)a, i.e., that 

poo 1 
(2.7.13) / e-^Wrft < -g-^-'^^") . 

First, we note that, since '(/'(t) > il^iv) + (^ ~ y)'4^'i.y)i we have, for j/ > 0, that 

1 

so that 



(2.7.14) / e-^^'^dt < , , -i.{v{x)+2a) 

^ ' ^Jr,ix)+2a - V''(^(^)+2a) 



Also, 



1 r°° 

(2.7.15) -e-^ = / e-^^dt > a^ae-'^(^(^)+«) 

2 J-qix) 
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Since ip'{y) increases for y > 0, we have ip{r]{x)+2a) > ip{r]{x) + a) +'il^{a) . Thus, 
from (2.7.14), (2.7.15) we see that (2.7.13) holds provided i^('0) > 2, ai^'irjix) + 
2a) > a^. 

On the other hand, using again convexity, we see that 



oo 



< ^-aji' (v{x))-x ^ 



Thereby, if i^(V') > 1, (2.7.13) will hold provided a^'{'q{x)) > i/}{a), and in 
particular if 77(0;) > a. 

Thus we can assume r]{x) < a, ai(^'{r]{x) + 2a) < a^. This means that a and x 
stay bounded; but the conclusion is obvious then. □ 

It is of particular interest to consider the case where •0(a;) = x^, so that 
is Gaussian. In this case. Proposition 2.7.4 shows that one can take h{x,y) = 
K~^{x — y)"^- This recovers the concentration of measure for the Gauss space, as 
expressed by (1.7). There is, however, a big loss of information in (2.7.10); and the 
result obtained by taking 

h{x,y) = uvin{\rj-\x) - rj-\y)\, {rj-\x) - rj-\y))^) . 

is rather more precise than (1.7). 

The induction step of the proof of Theorem 2.7.1 reduces to the following. 

Proposition 2.7.5. There exists a universal constant L with the following prop- 
erty. Consider a function g on Q, and define 

(2.7.16) g{x) = inf ^(y) + ]rh{x, y) . 

Then, under (2.7.1), (2.7.2), we have 

(2.7.17) I e^dfi I e-^dfi < 1 . 

Jq Jq. 



Let us recall that we denote by vi the probability measure on M of density e^l^l/2 
with respect to Lebesgue measure. During the end of this section, for a; e M we set 

(/7(x) = min(|x|, x^). 

The proof of Proposition 2.7.5 is considerably simplified by the following obser- 
vation. 



Proposition 2.7.6. Consider a function g on Q,, and g given by (2.7.16). Then 
we can find two nonincreasing functions g\, g2 on M with the following properties 

(2.7.18) [ e^djji^ [ e^'dui; [ e'^dn^ [ e'^'dui 

Jq Jr Jq Jr 
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(2.7.19) VxeM, g2{x)<mlg^{y) + y<p{\x-y\). 

In particular, this implies that we have reduced to the case fj, = i/i, g nonincreas- 
ing, h{x,y) = ip{\x-y\). 

Proof. We define, for j/ e R 

(2.7.20) g,{y) = inf {t ; fii{g < t}) > z/i([y, oo))} 

(2.7.21) g2{x) = sup{u ; /x({^ > u}) > z^i((-oo, x])} . 

Thereby both gi, g2 arc nonincreasing; it should be obvious that (2.7.18) holds. 
We prove (2.7.19). Consider x <y. By (2.7.20), we have //(-B) > vi{[y, oo)), where 
B = {g< gi{y)}. By (2.7.1), (2.7.2), we have 

^i{{h{■,B)<if{t)})>c{^l{B),t) 

> c(j^i([y,oo)),t) 

= J^i([y-^,oo))- 

Since g < gi{y) + ip{t)/L on the set {/i(-, B) < (p{t)}, we get 

/i(^|?<^7i(2/) + ^|) >M[y-t,oo)). 

On the other hand, by (2.7.21) we have 

l^{{g > g2{x)}) > iyi{{-QG,x]). 

Thus, ift > y-x, wehave5r2(a^) < gi{y) + (p{t)/L. Thus (a;) < g{y) + (p{y - x)/ L, 
and (2.7.19) follows. □ 

We next show that we have reduced the proof of Proposition 2.7.5 to the follow- 
ing. 

Proposition 2.7.7. There exists a universal constant L with the following prop- 
erty. Consider a nonincreasing function f on M, with /(O) = 0. Define 

(2.7.22) /(^) = inf/(2;) + l<^(|x-y|). 

2/eK L 

Then, if f has a Lipschitz constant < 2/L we have 

dvi < 1 . 



We prove the claim stated before Proposition 2.7.7. In view of Proposition 2.7.6 
and (2.7.19), it suffices to prove that J e^^dvi J e~^^dvi < 1, where 'gi is given by 
the right-hand side of (2.7.19). Define now 



(2.7.23) 



f{y) = sup^i(a;) - -'p{\x - y\) 
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Since for all x and y we have gi{x) < gi{y) + j^ip{\x — y\), we see that f{y) < gi{y). 
Thus, / e-^diyi > ] e'^^dvi. Also, by (2.7.23), wehave^i(x) < f {y) + ^ip{\x - y\) 

for all so that gi < f. Thereby it suffices to prove that J e-^dvi J e^-^dvi < 1. 
The condition /(O) = is certainly not restrictive, and / has a lipschitz constant 
< 2/L by (2.7.23) since (p has a lipschitz constant < 2. 

Upon seeing the result of [T4] exposed in a seminar, B. Maurey produced a 
rather magic proof of Proposition 2.7.7. The proof we will give is more in the spirit 
of the arguments of the present paper, and is likely to be more instructive as it 
prepares for the considerably more delicate results to be presented in Chapter 4. 
We start by a simple lemma. 

Lemma 2.7.8. Consider a nonincreasing function u on R, such that 1/(0) = 0. 
Then 

I u'^dvi<K^{u{-k)-u{-k + l)fe-^. 

k>l 



Proof. For simplicity we set Uk = u{—k). Thus 

2-k+l 



k>l 

Since < 2w^_j^ + 2{uk — Uk-iY, we have 

5 < 2 5^ 4-ie-'+' + 2 ^(«fc - «,-i)^e-^+i 

fe>l k>l 

But since tto = 0, the first sum is exactly 25/6, so that 



s(l--\< 2e J](«fe - Uk-ife-^ . □ 

^ k>l 



During the proof of Proposition 2.7.7, we will consider another number 1 < 
M < L. The numbers M, L will be chosen later. The crucial part of the proof of 
Proposition 2.7.7 is as follows. 

Proposition 2.7.9. Consider a non-increasing function u on R, with 'u(O) = 0. 
Assume that \u\ < 1/M, and set u{x) = mfu{y)-\-(fi{\x — y\)/L. Then, if L > KM, 

V 

we have 

f M f 

(2.7.23) {u- u)dvi >-pT u'^dui . 

Jr K 

Moreover, if M > K, we have 

(2.7.24) [ e^dvi + / e'^'dvi < 2 - ^ /" u'^dvi . 
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Proof. To prove (2.7.23), it suffices to prove it when the right-hand side is replaced 
by /{„>o} u'^dii (resp. J^^^Qj v^dii). The arguments for these two cases are similar 
so we treat the first case only. We set Uk = u{—k), so that Uk < M~^, and 
M(uk - Uk-i) < 1. We set Nk = [2/M{uk - Uk-i)]. Thus we have Nk > 2 and 

(2-7.25) — ^ <Uk- Uk-i < ^ 



2MNk ~ MNk 

For /c > 1, £ > we set ak/ = —k + 1 — l/Nk, and Uk,(. = u{ak/)- Thus Uk,o — Uk-i, 
'^k,Nk — Uk- For /c > 1, 1 < I < Nk, we consider the subset Rk/ of M? given by 

4 

Rk,e =]ak,i+i, Ofc.^x] min('Ufc,^ Uk,e-i + yrTf )' • 

We observe that no point belongs to more than two intervals ]ak/+i, ak/[, for 
1 < i < Nk, A; > 1 so that the rectangles Rk/ have the same property. Since 

u{x) > Uk^i for X < ttk/, Rk,i is below the graph of u; but, since u{ak,e.-i) = Uk,£-i, 
we have u{x) < Uk,£-i + 4:/LN^ on [ak,i+i,ak,£]- Thus Rk,£ is above the graph of 
u, and hence 

[ {u-u)dPi >\y2i X] ^i{[ak,£+i-,a-kA) ('^k,£ - min (uk,£,Uk,£-i + )) • 

k>ll<£<N, ^ ^ ^^^k/J 

Since iyi{[ak,£+i, ak/]) > e~^/KNk, we have 



j{u-u)d.,>^Y.Wk ^ ("'^'^ - "^'^-^ " L^) 



k>l 



> J]e ''{uk-Uk-if 



k>l 



by (2.7.25), and provided L > 16M. Thus, (2.7.23) follows from Lemma 2.7.8. 
To prove (2.7.24), we use that < 1 + x + for \x\ < 1. Thus 

/ e"cJz^i + / e""(ij^i<2+ / {u - u)dui + / u^dpi+ i u^dvi. 
Jr Jr Jr Jr Jr 

Now, 

2 1 
< 2u^ + 2{u - uf < 2^2 + — (w - u) < 2u'^ + ^{u-u) 

provided M > 4, and thus 

/ e^dvi + / e""dz^i < 2 + 3 / v^dvi - - {u - u)dvi 
Jr Jr Jr 2 
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and the result follows from (2.7.23). □ 

Proof of Proposition 2.7.7. We observe that, for a e R, we have a (2 — a) < 1. Thus 
it suffices to show that 

(2.7.26) I e^dui+ e~fdui<2. 
We set u = min(l/M, max(/, -1/M)). Thus 

P P POO 

(2.7.27) / e-^dui < / e'^'dui + / {e'^ - e^'^)dv^ 
Jr Jr Jb 

where f{b) = —1/M. We observe that if u{x) < 1/M, then f{x) < u{x). Indeed 
if u{x) < 1/M, then given e with u{x) < e < 1/M, there exists y with u{y) + 
L~^(p{\x — y\) < e. Thus u{y) < 1/M, and thus f{y) < u{y), so that f{x) < e. 
Then, if c is the largest so that /(c) = 1/M, we have 

(2.7.28) / e^dvi< [ e^diyi+ [ {e^ - e^/^)di^i . 
Jr Jr J-oo 

Since /(O) = 0, we have c < < 6. Since / has a Lipschitz constant < 2/L, for 
X > b we have 



and thus 



/OO pOO -I 

e-^dvr < ^e'/^+^(-b)/^e--dx 



gl/M 

Y3^-i([^oo)) 



Hence we have 



ui{[b, oo]) 



K 

< — - — / u dvi 



since u{x) = —1/M for x > b. Using (2.7.27), (2.7.30), and making a similar 
computation for /^^(e-^ — e^/^)di'i yields 



f e^dui + [ e ^'dui < f e^dvi + [ e "di^i + f u^dvi . 

Jr Jr Jr Jr ^ Jr 

It then follows from (2.7.24) that (2.7.26) holds provided M >K, L> KM^. □ 

It would be of interest to understand exactly which are the functions (p such 
that, if one sets 

/(^) = ^^ifiy) + 'fii^ -y) 1 



CONCENTRATION OF MEASURE AND ISOPERIMETRIC INEQUALITIES 39 



then J e-^dui J dvi < 1. On the other hand, the situation is considerably clearer 
if one considers the standard Gaussian density 71 rather than v\ . In that case, the 
obvious adaptation of Maurey's argument shows that if a > 1, and if f{x) = 

inf Q;/(y) + 2{a+i) ~ y)'^i ^^^^^ / '^'^ '^7i(/ e~-^(i7i)" < 1. Thereby, by induction, 

and with the notations of (1.7), we get 

lN{At) > 1 

so that, by optimization over a, for t > ■\/2 log(l/7Ar(A)), we get 

IN (At) >l-exp-^{t- V21og(l/7Ar(A)))2 
which is not so far from (1.7). 



40 



M. TALAGRAND 



3. Control by q points 
3.1. Basic result 

Consider an integer q>2. For subsets Ai^. . .^Aq of and x e Q,^ ^ we set 

(3.1.1) /(Ai, ...Aq,x)= inf{fc ; 3|/i e , y2 e ^2, . . . , G A, ; 

CBxd.{i<N;xi^{yl...,yl}}<k}. 

Theorem 3.1.1. We have 



(3.1.2) j Q^(^i' -'^<"^)dP(a;) < -j^ 

In particular we have 



(3.1.3) P{{f{A,...A,x)>k})< ^ 



The induction method will reduce this statement to a simple fact about functions. 
Lemma 3.1.2. Consider a function g on such that 1/q < g <1. Then 



(3.1.4) / -dfi{ [ 

Jo. 9 JQ 



gdiiY < 1 . 



Proof. We could use the extreme point argument of Lemma 2.1.2. One alternative 
method is as follows. Observing that logo; < a; — 1, to prove that < 1 it suffices 
to show that a + qb < q + 1. Thus, it suffices to show that 

1 f 

-d/j, + q gd/j, < q + 1 . 



'n 9 Jn 

But this is obvious since x~^ + qx < q + 1 for q~^ < x < 1. □ 
Corollary 3.1.3. Consider functions gi on Vt, gi < 1. Then 



(3.1.5) / niin ( Q, — ) d^TT / gid/i 

Jn V 9iJ f^gJ 



< 1. 



Proof Set g — (min(g, ^)) ^, observe that gi < g, and use (3.1.4). 

i<q 

We now prove Theorem 3.1.1 by induction over N . For N = 1, the result follows 
from (3.1.5), taking gi = 1^.. 

We assume now that Theorem 3.1.1 has been proved for A^, and we prove it for 
N + 1. Consider sets Ai, . . .,Aq of For a; e f], we define the sets Ai{u) as 
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in (2.1.5) and we consider the projection Bi of Ai on . The basic observation is 
that 

(3.1.6) /(Ai, ...,A„ {x, a;)) < 1 + f{B,, ...B„ x) 

and that, if, j < q 

f{A^,...,Ag,{x,u))<f{Ci,...,C„x) 

where Ci = Bi for i ^ Cj = Aj{oj). 

If we set gi{(jj) = P{Ai{uj))/P{Bi) using Fubini theorem and induction hypoth- 
esis, we are reduced to show that 



/ mi 
Jq 



1 \ 1 
mm I g, mm — ■. — r 1 < 



i<q 



which is (3.1.5). □ 
3.2. Sharpening. 

Given ct > 1, we can now, in the spirit of Proposition 2.2.1, look for the largest 
number a — a{q, a) for which we can prove that 

(3.2.1) J* a{q, a)/(^--'^--)dP(x) < J^^^ ■ 

i<q 

Following the proof of Theorem 3.1.1, we see that we can take for a(g, a) the unique 
number x > 1 such that 

(3.2.2) x + qax-^/" = l + qa. 
It then follows from (3.2.1) that 

(3.2.3) P({/(A,...,A-)>fc})<inf^^^. 

There is no obvious way to compute the right-hand side of (3.2.3). However, 
for large q, we have the following, that improves upon (3.1.2) for large values of k 
{k > glogg). 

Proposition 3.2.1. There exists a universal constant go such that, if g > go, we 
have 



(3.2.4) PmA...,A,.)>k})< (j^^^) (^) 



(jlogq 



Proof. We take a = log g, and we show that for g large enough, we have 

o(?, a) > a :— 1 + (l j glogg. 
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For large g, we have a > g, so that a^/" > e, so that 

a - 1 < get f 1 ^ ) 

and thus a + qaa~^^'^ < 1 + ga. □ 

It is interesting to note that Proposition 3.2.1 is rather sharp. Consider the case 
where f2 = {0, 1}, and where gives weight p to 1 {p <l/2). Assume for simpUcity 
that r = pN is an integer. Consider the set A= {x E il^ ; ^ Xi < r}. Then P{A) 

i<N 

is of order 1/2. Considering s = rq + k, we clearly have that Yl — ^ implies 

i<N 

fiA,...,A,x)>k. Thus P{{f{A, ...,A,x)>k})>p%l-p)''-'Q. 
When s < N/2, we have (^) > (iV/2s)*, so that 

P({/(A...,A.}>«=})>(^)'e-">(^)' 

> ( ^ 

~ \2e{q + k/r) 

If we take k > glogg, fixed, and then r of order k/qlogq, we get a lower bound of 
order (1/i^glogg)'^. 

3.3. Penalties. 

The result of this section is the one single major theorem of Part I that has not 
been motivated by direct applications. Rather, it has been motivated by a desire 
of symmetry with Sections 2.7 and 4.4. 

We consider a "penalty function" h{uj,uj^, . . . ,a;^) on $7^+-'^. We assume h > 
and 

(3.3.1) u} e {cj\ . . . ,a;«} ^ h{uj,uj^, . . . ,a;«) = . 

For subsets Ai, . . . ,Aq of , we consider 

{3.3.2)fh{Au . . . , A„ x) = inf i 5^ hixi, yl . . . , ; e ^li, . . . , y« G A, 

\^i<N 

The case considered in Section 3.1 is where /i(a;,a;^, . . . ,a;^) = 1 unless uj e 
{u>^ , . . . , Lo'^} , in which case it is zero. 

Given subsets Si, . . . , of $7, we set 

(3.3.3) h{uj,B-L,...,Bg) = mf{h{uj,uj'^,...,uj'^);u^ e 5i, . . . ,a;^ e 5J . 

To control how large h is, we will consider a nonincreasing function 7 from ]0, 1] 
to M"*", and assume that 

(3.3.4) yu en,yBi,...,BgGQ, h{uj, B^,...,Bg) <J2 liKBi)) . 

i<q 
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A typical case where this condition is satisfied is when 

/i(a;,a;\...,a;«) = ^hi(u;') 

i<q 

for functions hi that satisfy the tail condition iJ,{{hi > 7(t)}) < t and when 7 is left 
continuous. Indeed, if t < fi{Bi), then Bi contains a point j/j with hi{yi) < ^(t). 

We consider a convex function ^:]0, 1] — > R"*", and we make the mild technical 
assumption that the inverse function ^ satisfies 

(3.3.5) |^'(^ + 1)|>1|^'(,;)|. 

(We put I rather than ^ simply to allow the case ^{x) — .) 

Theorem 3.3.1. There exists a universal constant K such that for q> K, under 
(3.3.1), (3.3.4), (3.3.5), if, for each s<l, we have 

(3.3.6) / -f-\e{s) - e{t))dX{t) < 

J s 



q\d'is) 



then, for each subsets Ai, . . . ,Aq of , we have 



(3.3.7) J e/'^(^i'-..,^.,a;)(^p(a;) < exp l^e{P{Ai))^ . 

To understand (3.3.6) better, we observe that the term 9'{s) arises simply because 
9{s) — 0{t) resembles (s — t)6'{s) for t close to s. Actually, since 6{s) — 9{t) < 
{s — t)9'{s), change of variable and Lebesgue theorem show that (3.3.6) implies 
that ^~^{u)du < q~^\og{q/ K). In the case where 7 is constant, one can take 
h{uj ^ uj^ ^ . . . uj'^) = g7 whenever uj ^ {u;^, . . . , o;*^} (and otherwise h = 0). Then 
the integral in (3.3.6) has to be interpreted as \{t:s <t; 9{t) > 9{s) — 7}|. When 
9{x) = — logx, this is s{e'^ — 1), and (3.3.6) holds whenever 7 < q~^ log{q/K). We 
then almost recover Theorem 3.1.1. 

To prove Theorem 3.3.1, it suffices, by the induction method, to prove the fol- 
lowing. 

Proposition 3.3.2. There exists a universal K such that, under conditions (3.3.1), 
(3.3.4), (3.3.5), (3.3.6) if, we consider functions {ui)i<q on Q, < Ui < 1, and 
define 



i<q 



(3.3.8) v(u;)= inf V^(wi(a;^)) + /i(a;,a;\ . . . ,a;«) 

then we have 
(3.3.9) 
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Proof. For clarity, we will replace (3.3.6) by 

(3.3.10) J^' ^-\e{s) - e{t))dx{t) < 1^ 

and we will determine in due time a good choice for r. We already assume r < 1. 
The two main parts of the proof are the research of upper bounds for e'^d/i, and 
of lower bounds for ^{Jq Uidji). 

i<.q 

Step 1 For i < q,we set Si = inf 0{ui{u)) — ^(supuj(a;)). By (3.3.8) and (3.3.1), 
taking a;* = oj, we see that if we set /S = ^ 5j, we have 

i<q 

(3.3.11) v{u) < e{ui{u)) + J2Sj = e{ui{u)) + S-Si. 

Step 2 We make the convention that 7(0) = oo. For i < g, we define Sj by 

d{si) = M{^ifiaui>t})) + e{t)}. 

Thus we have 9{si) > Si and for t > Si we have 

(3.3.12) i^{{ui > t}) < ^-\e{si) - e{t)) . 

Step 3 We show that for any subset C of fi, we have 

(3.3.13) / e'^dfi < n{C) exp ( V e{si 

By definition ofs^, given e > 0, we can find ti such that 7(/i(i?i)) +6'(tj) < 6{si)+e, 
where Bi = {ui > ti}. Since 9{ui{ijj'^)) < 9{ti) for cj* e -Bj, we have, by (3.3.8) 

v{u;)<J2^iti) + K^^Bi,...,Bg) 

i<q 

SO that (3.3.13) follows by (3.3.4), since e is arbitrary. 
Step 4 Consider now a number m. We set 

(3.3.14) z = J min{{v -m)+,l)dn andC = {v >m + l} . 

Thus, in particular //(C) < z. 

Using the inequality < 1 + 2x~^ for a; < 1, we get, using (3.3.13) 

/ e"-^d// < f + f <l + 2z + ii{C) exp ( V 9{si) - m ] 
Jq Jq\c Jc \ / 



CONCENTRATION OF MEASURE AND ISOPERIMETRIC INEQUALITIES 45 



SO that 



(3.3.15) J e"-'^diJL < 1 + 2 ^2 + exp e{si) - m j 

Step 5 We now turn to lower bounds for ^ 0(J^ Uidn). For each i < consider 
a number mj, and set 

(3.3.16) Wi{uj) = v{uj) - S ^ Si - e{mi) 

(3.3.17) Wi= ! min(u;+,l)(i//. 

Jq. 



We show that 



(3.3.18) I {ui -mi)dfi< ( ^ " qi^IT' m ) • 

By (3.3.11), we have 

(3.3.19) 0{ui{uj)) > v{ij) - S + Si = Wi{u;) + 9{mi) . 

Now, by (3.3.5), for y > x, we have 

1 

C(y) <^{x) + -^'{x)mm{l,y-x). 

Taking x = 6{mi), y = x + Wi{Lj), combining with (3.3.19), and recaUing that 
C'{0{mi)) = 9'{mi)~^ yields, when Wj(a;) > that 

Ui{u) <mi + — - min(l, Wi{u;)) 

from which (3.3.18) follows by integration. 

Step 6 We take rui = Si. It follows from (3.3.12), (3.3.10) that 



J{ui>Si} \9 {Si)\ 



Combining with (3.3.18), observing that Wi{uj) > implies Ui{uj) < rrii by (3.3.19), 
and using convexity of 9 yield 



(3.3.20) 9{ [ Uid/i) > 9{si) - r + 

Jq. 



3 



We choose the number m of Step 4 as the smallest for which 

q 

card{i < q ; S — Si + 9{si) < m} > - . 
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We observe that if S — Si+9{si) < m, then Wi > z, where Wi is given by (3.3.17) 
and z by (3.3.14). Thus (3.3.20) shows that if we set i? = ^ ^iln Uidn), we have 

i<q 

(3.3.21) Y,0{si)<R + qr-^z. 

i<q 

Combining with (3.3.15) gives 

/ e^-™d/i< 1 + ^(2 + e^+«^-5^-^) 
< 3 + ze''^-5^e^-™ 
Calculus show that supze"^^/^ = 6/qe. Thus if we assume 

z 

(3.3.22) e^^ < ^ 



we have 



2 

For — m > 2, this is < e^~"^, so the proof is finished. 

Step 7 Thus, we only have to consider the case i? < m + 2. By definition of m, 
the set 

I = {i < q \ m < S - Si + d{si)} 
has cardinality > q/2. For i in /, we have 

i2<m + 2<2 + 5 + ^(sj) - Si 

and summation over z e / yields 

(3.3.23) R-S<2+ ^— V(^(sO - ^0 < 2 + - V (^(s^) - 5^ 

cardJ ^-^ q ^-^ 

i&I ^ i<q 

since d{si) — Si >0 for all i < q. On the other hand, (3.3.21) implies that 

J2iO{si)-Si)<R-S + qT 

i<q 

and combining with (3.3.23) yield (for g > 3, r < 1) that 

(3.3.24) J2e{si)- S < {2 + qT)<K + qT. 

i<q ^ ^ 

Step 8 We assume that ? > 3, r < 1, so that (3.3.24) holds, and we finish the 
proof. In Step 5, wc take nii = sup?ii(u;), so that 9{mi) = Si, and Wi = v — S does 
not depend on i. From (3.3.18) and convexity, we get 



e{ I u,dii) >Si + 

Jo. 



W_ 
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where W — Wi — min(l, w~^)dfx. 
We now have by summation that 

qW 

(3.3.25) ^<R-S. 

In Step 4, we take m — so that z — W. From (3.3.15), (3.3.24) we get 

(3.3.26) [ e"-^dfi <1 + 3W exp{K + qr) 

Jo. 

< exp(3W^exp(K + gr)) . 

According to (3.3.25), this is less than exp(i? — 5') provided exp(K + g'r) < 5/9, i.e. 
T < q~^\og{q/ K). Moreover, this requirement imphes (3.3.22). 

The proof is now complete. □ 
3.4. Interpolation 

One can express Proposition 2.1.1 as the fact that, if P{A) > 1/2, then for 
most of the elements x of O^, all but of order \/N coordinates can be copied by 
an element of A. On the other hand. Theorem 3.1.1 asserts that for most of the 
elements x of O^, all but a bounded number of coordinates of x can be copied by 
one of two elements of A. A rather natural question is whether both phenomenon 
can be achieved simultaneously (using the same elements of A). In this section, we 
will show that this is indeed the case. 

This fact seems to be a special case of a rather general phenomenon that can be 
informally formulated as follows: Suppose we have defined two notion of the idea 
"the points x and y are within -C distance S> t" ; we call these I and II respectively. 
Assume that there is good concentration of measure when the fattening Ai of A 
is defined as the collection of points x that are within distance t of A, when the 
meaning of this is defined with respect to notion I (resp. II). Then, in all the cases 
we have considered, it remains true that we have good concentration of measure 
when At is now defined as the collection of points x for which there exists a point y 
which is within distance t oi x with respect of the two notions simultaneously. Two 
specific examples are presented, one in this section, the other in Section 4.5. In both 
sections, we present an inequality, that quantitatively contains two rather separate 
ineqiialities presented before. Considerably more difficult (if at all possible) would 
be the task to find a formulation that would allow to recover sharp forms of these 
two inequalities. This direction of finding inequalities that "merge" several other 
inequalities is very natural. It remains at an embryonic stage. The reason is partly 
the intrinsic difficulty; partly the lack of concrete applications that would help to 
formulate precise needs. 

We now go back to question of finding an inequality encompassing at the same 
time the essence of Proposition 2.1.1 and Theorem 3.1.1. For simplicity, we consider 
only the case g = 2 in Theorem 3.1.1. For two subsets Ai, A2 of O^, x G 
a, t > 0, we set 



(3.4.1) /(yli, A2, a, t, x) = inf{/(y\ y^, a, t, x) ; y^ e A^ , y^ e A2} 
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where 

y^, a, t, x) = acard{z < N ; Xi ^ y} ; Xi yD 

+ teard{z < N ; xi ^ y] or xi y^} . 

Theorem 3.4.1. For each a < log 2, there exists to > such that 

ANt^ 



/4 
P[Ai 



In particular, by Chebyshev inequality, this implies that for u < SNIq, we have 
p(lf A2, a,.[^^,x\>u\\< " ^'^ 



SAT' ) - \ ) - P(^i)P(^2) 

When / (Ai, ^2, a, \J x) < u, by definition, we can find e Ai, G A2 such 
that 



acard{z < N ; Xi ^ {y}, y"^}} + y ^cardji <N-Xi^y] or ^ y^} < u 
so that in particular 

u 



CQxd{i<N-Xi^{ylyi}}< 



a 



card{z < N ; Xi ^ yj or Xj ^ 1/^^} < VSNu . 

We would like to point out that the factor e^-^* in (3.4.2) is not optimal. This 
factor can be improved, in particular, with greater effort on the calculus compu- 
tations of the proof we will present. Further improvement would be possible as in 
Section 1.2, but we have not pursued that direction since it is not clear at the present 
time what would be an optimal quantitative form of the phenomenon described by 
Theorem 3.4.1. 

The key to Theorem 3.4.1 is the following. 

Proposition 3.4.2. Given b < log 2, there exists to > such that, if t < to, for 
any two functions ^1,5^2 < 1 on Q, we have 

(3.4.3) / min e^ , . , . dfi{uj) < - — —— — — . 



Proof. The relatively simple method we present does not yield the optimal de- 
pendence in t in the right hand side of (3.4.3), but it avoids lengthy unpleasant 
computations. Arguing as in the proof of Lemma 3.3.2, we see that 

j hdn J gidn J g2diJ, < exp J {h + gi + g2 - 3)diJ, . 
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Thus, if we set 

K9u92) = mm e , — , — , 

V 9i 92 9i92, 

it suffices to show that for t small enough, and all numbers 5^1, 5^2 < 1, we have 
(3.4.4) %i,^2)+^i+^2 <3 + 4t2. 

Certainly, we can assume gi > g2 and 2t < b. 

Case 1. 5'2 < 5*1 < e*~". In that case 

h{gi. ^72) + 51 + ^72 - 3 < + 2e*-^ - 3 . 

Since < 2, we have + 2e~^ — 3 < 0, so that we can find Iq such that + 
2e*-'' - 3 < if t < to- 

Case 2. g2 < e*~^ < . In that case 

h{9i,92) + gi +92-^< h 5^1 +5^2 - 3 

91 

<e'' + 2e*-^ - 3 

since the function x + ^ jx decreases for a; < 1, and we conclude as above. 

Case 3. e*"^ < 92 < e"*. In that case, using again that the function x + jx 
decreases for a; < 1, we have, since g\ > 5^2, 

e* 

hi9ii92)+9i +5'2-3< \-9i+g2-3 

91 

e* 

< — + 2t/2 - 3 < e^* + 2e"* - 3 

92 

since the function 2x+e*/x is convex, and thus on the interval [e*~^, e~*] is bounded 
by the maximum of its values at the endpoints. Also, we note that e^* + 2e~* — 3 < 
4^2 

Case 4- 92 > e~*. Then 

h{9i,92) +9i+92-^< +9i+g2-^<e^^ + 26"* - 3 

9192 

since, when c > 1, the function x + c/x decreases for a; < 1. We then conclude as 
above. □ 

We will let the reader complete the proof of Theorem 3.4.1 using the induction 
method and Proposition 3.4.2. The basic observation is that, if Bi denotes the 
projection of Ai on O^, we have for x G O^, a; G O, 

f{Ai,A2,a,t, {x,Lu)) <a + t + f{Bi,B2,a,t,x) 

/(Ai,A2,a,t, {x,u)) <t + f{Bi,A2{uj),a,t,x) 

f{Ai, A2, a, t, {x,Lj))<t + /(Ai(cj), B2, a, t, x) 

f{Ai,A2,a,t, {x,(jj)) < f{Ai{(jj),A2{uj),a,t,uj) . 

For the induction hypothesis, one then fixes a < b < log 2, and take to small enough 
that a + tQ <b. 
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4. Convex Hull 

4.1. The basic result 

The main idea of this section is the introduction of a rather different way of 
measuring how far a point x is from a subset A of VL^ . We introduce the set 

Ua{x) = {isi)i<N e {0,1}^ ; 3y e A , Si = ^ Xi = . 

We denote by Va{x) the convex hull of Ua{x), when Ua{x) is seen as a subset 
of R^. Thus Va{x) contains zero if and only if x belongs to A. We denote by 
fc{A, x) the £^ distance from zero to Va{x) (the letter c refers to "convexity"). The 
corresponding notion of "enlargement" of A is as follows: 

(4.1.1) A1 = {xen''; f,{A,x)<t}. 
These notations will be kept throughout the paper. 

Theorem 4.1.1. For every subset A of Q,^ , we have 

(4.1.2) j exp \fl{A,x)dP{x) < 

In particular 

(4.1.3) ^^^*^-^"p(^'"*'^'- 

In order to understand better (4.1.1) it is worthwhile to note the following simple 
result. 

Lemma 4.1.2. The following are equivalent 

(4.1.4) X e A1 

(4.1.5) V(ai)i<jv , 3y e A , ^{ai; Xi Vi} <t j^af . 

i<N y i<N 



Proof. The linear functional a:x ^ ^ cx-iXi on R-^, provided with the Euclidean 

i<N 



2 
i 

i<N 



norm, has a norm ||q:|| = / S Q^f . Since Va{x) contains a point of norm < fc{A, x), 



the infimum of a on Va{x) is < /c(A, x)||a||; but since Va{x) is the convex hull 
of Ua{x), the infimum of a on Ua{x) is the same as the infimum on Va{x). Thus 
(4.1.4) implies (4.1.5). The converse (that is not needed in the paper) follows from 
the Hahn-Banach theorem. □ 



It is very instructive to compare (4.1.3) with (2.1.3). If one takes t = k/vN, 
CKj = 1, one sees that (4.1.3) implies 

P(/(Ax)>fc)<-pi^e-'^V4iv. 
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The only difference with (2.1.3) is the worst numerical coefficient in the exponential. 
But the strength of (4.1.3) is, of course, that all choices of ctj are possible. This 
makes Theorem 4.1.1 a principle of considerable power, as will be demonstrated 
at length in Part II. It docs, however, take some effort to fully understand the 
potential of Theorem 4.1.1. To illustrate one use of Theorem 4.1.1, let us consider 
the case where Q — {0, 1}, and where the probability // gives mass p to 1 (and mass 
1 — p to zero), where p < 1/2. Consider a subset A of (0, 1}-^, and assume that A 
is hereditary, i.e., that if y — {yi)i<N G A, and if {zi)i<N is such that Zi < yi for all 
i, then z & A. Consider x G {0, 1}^, and J = {i < N ; Xj — 1}. Set m(x) = card J. 
Define = 1 if i G J, 0;^ = otherwise. Then Lemma 4.1.2 shows that we can 
find y E A such that 

card{z e J ; Xi ^ yi < fc{A, x)y^m{x)} . 

Since A is hereditary, we have f{A, x) < fc{A, x)^ym{x). 
Thus we have, for all m' 

P{{f{A, .) > t}) < P{{fM, •) > ^}) + P{m{y) > m') 

Since the last term becomes very small for vn! > pN, we recover the correct order 
1/Np of the coefficient of in (2.3.5). 

The key to Theorem 4.1.1 is the following simple lemma. 

Lemma 4.1.3. Consider < r < 1. Then 

(1 — X)"^ 

(4.1.5) inf r-^exp^^ < 2 - r . 

0<A<1 4 



Proof. Taking A = 1 + 21ogr if r > e and A = otherwise, and taking 
logarithms, it suffices to show that 

/(r) = log(2 - r) + logr + (logr)^ > . 

Now /(I) = 0, so it suffices to show that f'{r) < 0. Since /'(I) = 0, it suffices to 
show that {rf'{r)y > 0, or, equivalently, by calculation that (2 — r)~^ — < 0. 
But (2 - r)-2 < 1 < r^^ □ 

We now prove Theorem 4.1.1, by induction upon N. We leave to the reader the 
easy case N = 1. For the induction step from N to N + 1, consider a subset A of 
ri-^+^ and its projection B on ^l^ . For a; G ri, we set as usual 

^(a;) = {xen^; (x,oj) G A}. 

Consider x G O^, u> G z — (x^u). The basic observation is that 

s G Ua(co){x) =^ (s,0) G Ua{z) 
teUB{x)^{t,l)eUA{z). 
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Thus, for s e Va(uj){x), t e Vb{x), < A < 1, we have (As + (1 - A)t, 1 - A) G Va{z). 
The convexity of the function tt — > tt^ shows that 

(4.1.7) f^iAz) < (1 - A)^ + XfAAiuj),x) + (1 - X)fAB,x) . 

The main trick of the proof is to resist the temptation to optimize now over A. By 
Holder's inequality and induction hypothesis, we have 

J exp^f^iA,ix,oj))dPix) 

exp^(l-A)2(/ exp\f!{Aiu;),x)dPix))\f exp\f,{B,x)dP{x))'- 

4 Jqn 4 Jqn 4 



1.. .,2^ 1 \V 1 



<exp-(l-A) 



4' ' \P{A{uj))J \P{B) 



This inequality holds for all < A < 1. Using (4.1.6) with r = P{A{uj)) / P{B) < 1, 
we get 



/ 



exp ^/2(^, (a;,a;))dP(a;) < (^2 - 



P(i?) 



Integrating with respect to uj and using Fubini theorem yields 

/exp>..)a(P«.).^(.-^^) 



~p®^i{A)' ° 



since a; (2 — a;) < 1 for all a; real. 
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4.2. Sharpening 

We now try to improve (4.1.2) by allowing a right hand side for some 

q; > 0. In that case, it will be advantageous to measure the "distance" of s to Va{x) 
by the function 

fa{A x) = inf{s e Va{x) ; ^ ^{a, Sj)} 

i<N 

where 

1 + a — au 



(4.2.1) ^ {a, u) = a{l — u) log(l — u) — {a + 1 — au) log 



1 + a 



The reader should observe right away that fa{A, x) corresponds (with the nota- 
tion of Section 4.1) to fc{A, x) rather than to fc{A, x). This will be the case for all 
the extensions of Theorem 4.1.1 we will consider. 

As pointed out, Lemma 4.1.3 is the key to Theorem 4.1.1. It is a somewhat magic 
fact that when one tries to improve upon Lemma 4.1.3, the best possible function 
that can be used instead of the function (1 — A)^/4 can be computed exactly, leading 
to the formula (4.2.1). 

Lemma 4.2.1. Consider < r < 1. Then 

(4.2.2) inf r-^'*exp£(a,l- A) = 1 + a-ar. 

^ ^ 0<A<1 i'-^v ' / 

Proof. We will not give the shortest possible proof (that consists in checking by 
computation that for A = r{a + l — ar)~^ , we have r~^" exp^(Q;, 1 — A) = 1+a — ar). 
Rather, we will explain how (4.2.2) was discovered. We fix a, and we set f{x) = 
a~^^{a, x). The best choice for A is such that ctlogr + af'{l — A) =0, i.e., 
r = exp(— /'(I — A)). So we would like to have the identity, for < A < 1, 

exp(a/(l - A) + aA/'(l - A)) = 1 + a - aexp(-/'(l - A)) . 

Setting v = 1 — A, and taking logarithms, we want 

af{u) + a{l - u)f'{u) = log(l + a- aexp(-/' (i.))) . 
Differentiating in u and setting g{iy) — exp(— /'(z/)), we get 

a{l - u)f [u) = — — 

1 + a — ag[iy) 

so that §(1/) = ■ Taking logarithms and integrating yields (4.2.1). □ 

Lemma 4.2.2. The function ^ {a, •) is increasing and convex on [0, 1] and ^{a, u) > 



2(a+l) ' 

Proof. Computation shows that ^{a, 0) = ^(a, 0) = 0, and 

d^f a a 

> 



du'^ {a + 1 — au){l — u) a+1 
since u> 0. □ 
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Lemma 4.2.3. For a, a > 0, we have 

(4.2.3) l + a-aa<a 

(4.2.4) a+ (1 - a)exp^(a, 1) < a" 



a 



a 



Proof. To prove (4.2.3), we observe that the graph of the convex function is 
above its tangent at the point a; = 1. To prove (4.2.4), we observe that ^(ct, 1) = 
log(l + a), so that the left hand side is 

a -\- (1 — a)(l -\- a) = 1 -\- a — aa 

and the result follows from (4.2.3). □ 

Theorem 4.2.4. For a subset A offl^, we have 

(4.2.5) J e^pUA,x)dP{x)<^^. 



Proof. It is an obvious adaptation of the proof of Theorem 4.1.1. The case = 1 
follows from (4.2.4), and (4.2.3) is used as a substitute for the last inequality of 
(4.1.8). □ 

If we use Lemma 4.2.2, we see that (4.1.3) can be generalized into 

1 / at"^ 

(4.2.6) P(A=) > 1 - —J- exp 



P(A)« V 2(a + l) 

Optimization over a as in Corollary 2.2.3 yields: 
Corollary 4.2.5. For each subset A of Vl^ , 



(4.2.7) ^ > J2 log ^ ^ PiAl) > 1 - exp I -1 (^t - ^21og-^j 



It is an interesting question whether the term \/2 log 1/P(A) can be removed 
in (4.2.7). We will, however, see in Section 4.3 that the coefficient 1/2 cannot be 
improved. It must be pointed out that Theorem 4.2.4 bring considerably more 
than a simple improvement of the coefficient of t^ in (4.1.3). The reason is that 
i{a,l) = log(Q; + 1) becomes very large when a is large. In that case, (4.2.5) 
recovers certain features of (3.1.2), and in some ways, improves simultaneously 
upon Theorem 3.1.1 and Proposition 2.1.1. To see this, consider g > 1. We fix 
A C ri"^, and for x e fi^, we consider 

k{x) = inf{/c ; 3s e Va{x) ; card{i < iV ; > 1 - ^} < fc} . 
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Then, certainly, we have 



Now, 



k{x)C[q,l-^] <f,{A,x) 



aq,l--] =log--21og ^ 



q I + q 



so that 



(4.2.8) k{x)\og^<f,{A,x). 



On the other hand, by (4.2.5), we have 

P{fg{A,x)>t)< 



P{AY 
so that, by (4.2.8) 



(4.2.9) P{k{x) >k)<^ 



-feiogf / 4\'^ 1 



P{A)i \qj P{A) 



The relationship with (3.1.2) is as follows. 

Ifk{x) < k, we can find a family ijj^^ j<.m of points of and coefficients (q;j)j<^, 

< CKj < 1, Yl, ^3 — 1) that 

(4.2.10) card i i < ; J] - ^ " ^ f - ^ " 

On the other hand, if f{A, . . . , ^, a;) < A;, we can find y-^, . . . , in ^ such that 

(4.2.11) card |^ < ^5 E "^{..M} > ' " ^} " ^ " 

Certainly (4.2.11) is more precise than (4.2.10); however, for some important 
applications (see [T3]) (4.2.10) is just as powerful as (4.2.11). 

4.3. Two point space 

In this section, we consider the case where Q, = {0, 1} and where /j, gives weights 

1 — p to zero and p to 1. The miracle of Lemma 4.2.1 does not seem to happen 
again, so we will only consider statements of the type 

(4.3.1) J e^pUA,x)dP{x) < 
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where, for a couple u = {uq, ui) of positive numbers, we set 

fuiAx) = inf jito ^{s,^; = 0} + tti ^{s'^;xi = 1}: s e VA(a;)| . 

In other words, we take into account the fact that the points and 1 do not play 
the same role. 

If one analyzes the arguments of Sections 2.3, 4.1, 4.2, one sees that the best 
value the induction method allows to take for uq is the largest number s such that, 
whenever a < b, we have 



inf _-__e^^ + ^< 



o<A<i a«(i--^) 6"-^ 6" ~ ((l-p)a + p6)" 

or equivalently 

(4.3.2) (1 -P)— inf x'^^e^'" < — 1 -p 

for aU < a; < 1. 

(The best possible value of u\ is obtained in a similar way, changing p in 1 — 
and will not be considered.) 

The infimum in (4.3.2) is obtained for 

/ a log X 
A = max 0, 



2s 

The left hand side of (4.3.2) is constant for x < exp(— 2s/q;); thereby (4.3.2) holds 
provided, for x > exp{—2s/a), we have 

(4.3.3) (1 -p)e-tf 0°g-)' < — 1 -p 

Determining the best value of s for which this holds is an unpleasant task, so 
we will content ourselves with finding good values of s. Taking logarithms and 
differentiating, one sees that (4.3.3) will hold provided, for a; > 0, we have 

where we have set A = (1 — p)x + p. 
It suffices that for a; > we have 

(4.3.5) __log.>^_^. 

We first consider the case p — ^, and we show that in this case we can take 
s = -TT- Since 

(4.3.6) 1 - < (a + 1)(1 - A) = (a + 1)(1 - p){l - x) 
it suffices to see that 

2(1 -a;) 

0<a;<l=»loga;> ^ . 

1 + a; 

But the function f(x) = logx - ^^^^ satisfies /(I) = 0, f{x) = -(1 - a;)V(l + 
x)^ < 0. Using the notation fc{A,x) of Section 4.1.1, we then have proved the 
following. 
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Theorem 4.3.1. When Q — {0, 1} and /j, is uniform, for each a > 1 and each 
subset A of Vt^ , we have 

(4.3.7) / exp (^fl{A, x)\ dP{x) < ^ 



P{AY 



Compared with (4.2.6), we have gained a factor 2 in the exponent in the special 
case of the two point space. 

CoroUciry 4.3.2. When VL = {0, 1}, and /j, is uniform, for each a > 1, and each 
subset of , we have 



(4.3.8) t > ^/log ^ P{A1) > 1 - exp - ft - Jlog -pl^ 



Proof. From (4.3.7) and Chebyshev inequality, we get 

P(A^t) > 1 - —4^ exp ( —t"^ 

^ - P(A)« ^ V a + 1 

and we optimize over a as in the proof of Corollary 2.2.3. 

It is a natural question whether (4.3.8) can be improved into 

(4.3.9) P{A^) > 1 - Kex.p(-t^) . 

It should, however, be pointed out that the coefficient of is optimal. We will 
now show this, and at the same times, the optimality of the coefficient 1/2 in (4.2.7). 
Provide Q = {0,1} with the probability that gives mass p to 1. Set 

[ i<N 

(Thus, for N large, P{A) is about 1/2.) Consider y e {0, 1}-^, such that card J = m, 
where J = {i < N ; yi = 1}. Assume m > pN. Then any element a; of ^ differs 
of y in at least m — pN of the coordinates indexed by J. Using Lemma 4.1.2 for 
ai = 1/ ^Jm when z e J, ctj = otherwise, we see that 

(4.3.10) /,(A,^)>i^-^^)-"^-^^/^ 




vA?" V m 

If we think of m = vn{y) as a r.v., the central limit theorem shows that, as n ^ cxo, 
(m — pN^ I \fN is asymptotically normal, with standard deviation y^p(l — p). On 
the other hand, \J N/m converges to ^Jl/p in probability. Thus 

hm P(/,(A, .) > t) > _= / exp -- du 

1 f f 



Kt ^\ 2(1 -p) 
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li p — 1/2, the coefficient of is —1; and if we let p arbitrary, we cannot do better 
than the coefficient -1/2 of (4.2.7). 

We now go back to our main line of discussion, and we consider the case p < 1/2; 
we will show that in this case we can take 

x / ct , 1 ct 

(4.3.11) s = min — log 



K p 4:{a + l)p 

In particular, for a large, this is of order 1/p, rather than order log(l/p). This 
remarkable fact is closely connected to Theorem 4.4.1 below. To prove (4.3.11), we 
prove (4.3.5), depending on the value of x. 

Case 1. a: > 1/2. Then - log a; > 1 - a;, 1 - A''+^ < {a + - p){l - x), so 
that it suffices that 

a ^ p{l + a) 
~2i - A ■ 

Now A > -i, so that it suffices that s < 



Case 2. X < ^yp. Then - logo; > — |logl/p, so that it suffices that 



4s \p J (1 -p)A 
Since ^ > p, it suffices that 

a{l-p) , 1 

s < log - . 

4 p 

Case 3. ^/p < x < ^. It then suffices, since — logx > log2, A> {1 —p^^ that 

s < ^ ^ log 2 

which holds when s < log | . 
4.4. Penalties 

We consider now a function h on Q x Q, such that h > 0, and h{uj,uj) = for 
oj E fl. For a subset A of fl^ , and x e Cl^ , we set 

(4.4.1) Ua{x) = {{si) eR'l;3yeA;yi<N,Si> h{xi, y^)} . 

We denote by Va{x) the convex hull of Ua{x). The situation of Section 4.1 corre- 
sponds to the case where h{uj^uj') = 1 ii oj ^ uj' . 

In order to measure the "distance" of zero to Va(x), we consider a convex func- 
tion ip on R, with ■^(0) = 0. We will assume 

a; < 1 ^ V'(^) < ; a; > 1 =^ tjj{x) > x . 
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We set 

fh,^{A,x) = inf < ip{si) ; s = (sj)i<iv e Va{x) > . 

[i<N 

(Thus, the situation of Section 4.1 corresponds to the case •0(s) = s^.) The material 
of this section is connected to that of Section 2.6, and the notations of Section 2.6 
are in force in the present section. Thus 9 denotes a convex function from ]0, 1] to 
M+, with ^(1) = 0, hm 9{x) = oo, and ^ denotes the inverse function. We assume 

that (2.6.1) holds, and assume moreover that for a certain number 7 > 0, we have 

(4.4.2) b>o^\e{b+i)\>^\e{b)\ 

(4.4.3) |^'(1)|>7; w{l/2)>^. 

We recall the function S of (2.6.2), as well as condition H{^,w) of (2.6.3). 

Theorem 4.4.1. Consider a nonincreasing function w on ]0, 1], w < 9. Assume 
that w'^dX < 1, and that condition H{^,w) holds. Assume that for each subset 
B of Ct, we have 

(4.4.4) < ix{B) <- ^ e^pip{h{x,B))dfi{x) < exp«;(//(S)) 

2 Jq, 



(4.4.5) i^{B) >-, t>l^ i^{{x ■ ^l;{h{x, B)) > t}) < e-\l - //(S)) . 
Then, for each subset A of fl^ , we have 

(4.4.6) / exp ^fhAA x)dP{x) < exp 9{P{A)) 

where K depends on 7 only. 

We should observe first that only the values of w{x) for x < 1/2 matter. 

In order to compare Theorem 4.4.1 with Theorems 2.6.5 and 2.7.1, we first have 
to keep in mind that it is the function •0 o /i here that plays the role of h in these the- 
orems. The conclusion of Theorem 4.4.1 is stronger than that of Theorem 2.6.5 (the 
way Theorem 4.1.1 improves on Proposition 2.1.1) but weaker than the conclusion 
of Theorem 2.7.1 (since one takes convex hulls). Condition (4.4.5) strongly resem- 
bles (2.7.2). Condition (4.4.4) coincides with Condition (2.6.12) when ix{B) < 1/2. 
A simple calculation using (4.4.5) shows that for /x(-B) > 1/2, condition (4.4.5) is 
of a somewhat stronger nature than (2.6.12). 

An interesting case where it is worth to spell out (4.4.4) and (4.4.5) is when 
/i(a;, y) = h{y) depends on y only. Denoting by m a median of h, (4.4.5) will hold 
if ip{m) < 1. And, as seen after Theorem 2.6.5, (4.4.4) holds provided w{iJ,{{h > 
t})) > ip{t) (a tail condition of h). 
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To prove Theorem 4.4.1 when = 1, we observe that, since w < 9, (4.4.6) 
follows from (4.4.4) when /x(i?) < 1/2. When /i(-B) > 1/2, a simple computation 
using (4.4.5) shows that given 7, if K is large enough, then 

[ exp ^ij{h{x, B))dn{x) < 1 + 7(1 - /i(5)) < e{n{B)) < exp e{n{B)) 
Jq ^ 

since 9'{1) > 7. 

For the induction step, comparison with the proof of Theorem 4.1.1 shows that 
it suffices to prove the following (used for g — ^(/)). 

Proposition 4.4.2. There exists a constant L, depending on 7 only, with the 
following property. Under the conditions of Theorem 4-4-^} consider a function 
/ > on n. Set 

(4.4.7) fix) = inf (A/(x) + (1 - X)fiy) + Uiil - A)/i(^,2/))) . 

y€ii ,0<X<1 Li 

Then we have 



(4.4.8) 




Understandably, with the level of generality considered here, the proof cannot 
be very short. The reason why we have opted for great generality is that Theorem 
4.4.1 is a principle of considerable power (as will be demonstrated in Chapter 8) and 
that thereby it seems worthwhile to prove extensions of it under weak hypothesis 
on the function h. The proof will incorporate in particular ideas from Theorems 
4.1.1, 2.6.5, 2.7.1. 

A simple idea is that we will need to control 6{J ^(/)(i//) from below. This means 
controlling the lower tail of /. Set Bs = {f < s}, and denote by m a median of /, 
so that n{Bm) > 1/2. We set 

(4.4.9) b= ini \s+yw{fi{B,))] . 

s<m [Li J 

The first step of the proof will be to show that iJ,{Bg) is not too big, i.e. that b 
is not too small. 

Proposition 4.4.3. To prove Proposition 4-4-^} if L > 4/7, we can assume 

4 

(4.4.10) m<6+ — . 

L7 



Proof. We assume m > b, for otherwise there is nothing to prove. Using (4.4.7) 

with A = 0, we see that for each s we have f{x) < s + L~^i(; o h{x,Bs). Using 
(4.4.4) together with H51der's inequality, it follows that 

J e^djjL < exp + -^w(/x(i?s))^ 



CONCENTRATION OF MEASURE AND ISOPERIMETRIC INEQUALITIES 61 



SO that e-^d/j, < by taking the infimum over s <m. On the other hand, (4.4.9) 
imphes 

(4.4.11) s <m^h- s <\w{iJi{Bs)) 

i.e. 

1 



(4.4.12) 



w > b — s 
Ij 



We can hence appeal to Lemma 2.6.4 with C = {/ < 6} and t = 1/L to see that 
But, by (4.4.12) we have 

jjf-b\dfi<^Jwdfi<^{J w^dfi)'/^ < ^ 

and thus 

(4.4.13) X^^"^^"^^ - i^(c)ab) + ^\em . 

On the other hand, when /(a;) > b, we have 
and, by integration, since /i({/ > m}) > 1/2, we have (since m> b) 

I am < (1 - f^ic)m - km - cm) . 

Jn\c ^ 
Combining with (4.4.13) we get 

/ < m + h'm - km - cm) . 

Since we have shown that e-^dfi < e^, there is nothing to prove unless i{f)diJL > 
^(6) (for otherwise 9{J af)diJ,) > b). Thus we can assume 

(4.4.14) lm-arn))<^\em- 

Now, since m > b, from (4.4.2) follows that ^(m) < ^(6) - 7|C'(^)| niin((m - b), 1). 
Comparing with (4.4.14), we see that min(m — 6, 1) < 4:/{L^), so that if L > 4/7, 
we must have m — b < 4:/L^. □ 
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We consider the smallest number a for which 

\/s < m , m — s < aw{fi{Bg)) 

so that 

(4.4.15) Vs<m, \{aw > m - s}\ > iJ,{Bs) 

It is rather important to note that 

8 



(4.4.16) a < 



L72 ■ 

Indeed, if m — s < 8/L7, then 

m — s ^ m — s ^ 8 



w{fi(Bs)) - w(l/2) - L72 

On the other hand, if m — s > 8/L7, then, by (4.4.10), we have m — s < 2{b — s), 
so that 

m — s b — s 2 

w{fi{Bs)) -'^w{fi{Bs)) -L- 

We consider a second parameter M < L. Throughout the rest of this section, we 
will have to put conditions on L, M, L/M. For simplicity we make the convention 
that the expression "If L is large enough"... means "there exists a constant ^^(7), 
depending on 7 only, such that, if L > ^(7)..." and similarly for M, L/M. 

We set m' = m — We consider the function 



/' = min 



and the function g defined as 

9{uj) = f{uj) iff{u)<m' 

g{oj) = max ^m', min ^/(a;), m + j^^^ /('^) > 



Since / < /, it is simple to see that g < f. It is also simple to see that 
(4.4.17) ^(a;) ^ /'(a;) => g{u;) , /'(a;) e 



, 1 

m ,m + — 



Indeed, the right hand side does not occur only when /(a;) < m' , and then f{u!) = 
f{co) = g{u). We set 

C^{f>m}- D = h>m + ^ 
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Lemma 4.4.4. We have 

(4.4.18) e( f af)d^i) > I f'd^l-a^- I {f-mfdfi. 

\Jq J JQ Jc 

Proof. Since /' < /, we have > and 

We now appeal to Lemma 2.6.4 with t = a. We have 
/ e(/')<eM + e'M / {f'-m)dii + a^\^'{m)\+e{m) [ {f'-mfdf,. 

jQ Jo. Jc 

By convexity of d and since ^"{m) < \^'{m)\ this imphes 

e(l ^{f)] >m + [ {f-m)dii-a^- [ {f-mfd^. □ 
\Jq, J Jo, Jc 

Lemma 4.4.5. If L and M are large enough, we have 
{AA.j^y^d^ 

JQ. 

<expQ j {g + f)d^i + 20" + 2 J {f-m)^dfi + J {e'^-^-e^/^)+df?j . 
Proof. First, we observe that 

(4.4.20jf e^-"^d/x< J exp(^min (j - m, d/i + J [e^-"^ - e^'^)+dii . 

We observe that min ^/ — m, < g—m. Since < 1+x+x^ for x < 1/M < 1, 



we have 

(4.4.21) exp (mm - m, ) ] d^ 



< I e^-'^di^ < 1 + / {g- m)di^ + [ {g - mfdii . 
Jci Jci Jci 

Now, by (4.4.17), and provided L, M are large enough, 

(4.4.22) [g - mf < 2{f' - mf + 2[g - f'f < 2{f' - mf + i(/ - ^) . 
We recall also that 

/ {f'-mfd^i< I [f -mfd^i<a^ . 

The result follows by combining these inequalities, and using that 1 + x < e^. □ 

It follows from Lemmas 4.4.4 and 4.4.5 that to prove Proposition 4.4.2, it suffices 
to prove the following when M, L/M are large enough. 

(4.4.23) / (/' - g)dii >6a^ + 6 f (/' - mfd/i + 2 f (e^""^ - e^/^)+d/x • 

JQ, Jc JQ 

This follows from the next three lemmas. 
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Lemma 4.4.6. We have 

(4.4.24) / (e^-"* - e^/^)+di^ < ^I^{D) < / (/' - mfd^i . 

Jo. L L Jq 

Lemma 4.4.7. If L/M is large enough, we have 
Lemma 4.4.8. If L/M is large enough, we have 



[ if - g)dfi > 



K 



Proof of Lemma 4-4-6- The definition of / (with A = 1) shows that /(a;) < m + 
1/M + L-^i;{h{io, n\D)). Thus by (4.4.5) we have 



and thus 



l^[if>m+- + -}]<e-^l,{D) 



I (e^-^ - e^/^)+dfi < J]ei^(e* - 1)6"'=+ V(^) 
•^^ fc>i 



from which the first inequality of (4.4.24) follows by elementary estimates. (The 
second inequality of (4.4.24) is obvious.) □ 

Proof of Lemma ^.^.l. Step 1. For /c > 0, we define 



Ofc = sup |t ; /x({/' > t}) > ^1 



Thus m < Qk < ak+i < m + 1/M. We consider a set Zk C {f < ak} such that 

We set Z'f^ = {uj; h{uj, Z^) < 2}. Since ip{x) > x ior x > 1, we have 

ZiD{uj;iP{hiu,Zk))<2} 
so that by (4.4.5) we have //(Z^) > 1 - l/2e^+^. We set, for A; > 

w^fe = 4 n iZk+2\Zk+i) . 

We observe that the sets {Wk)k>o are disjoint, and that 
(4.4.25) l^{Wk)>^(--^)> ^ 



26^= V e e2 / - 2e'=+3 
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Step 2. We show that 

(4.4.26) / (/' - g)dfi > ^(afc+i - akfi^{Wk\D) . 
JWk ^ 

Consider uj e Wk\D. Then f{uj) = f{u;), so that given A e [0, 1], oj' e Q 

(4.4.27) /'(a;)-/(a;) = /(a;)-/(a;) 

> (1 - A)(/(a;) - fioj')) - - ^')) • 

We can find uj' G Zk such that h{u,uj') < 3. Then f{uj) — f{u;') > ak+i — a^. We 
can take < A < 1 such that 1 — A = M(ak+i — cbk)/^- Then (4.4.27) yields, since 
ip{x) < for a; < 1, that 

/ (w) - /(w) > — (afc+i - Ofc) ^(ofe+i-flfc) . 

Thus, if L/M is large enough, 

M 



f[uS) - fiu) > —{ak+i - ak) 



2 



Thus 



/(c^) < f'ii^) - ^(ofc+i - akf 



Since a^+i — CLk ^ ^/M, and f'{uj) > a^+i, the right-hand side is > > m, so 
that 

g{i^) < f'{i^) - -r(«fc+i - akf 



and thus 



/'(i^) - gii^) > ^(ofc+i - o,kf 



from which (4.4.26) follows by integration. 

Step 3. Denote by /cq the largest integer such that l/4e'^""'"'^ > n{D). Thus 
IJ'iWk) > 2ij,{D) for k < ko, and by (4.4.26) and summation, we get, since iJ,{Wk\D) > 
l^{Wk)/2: 



(4.4.28) f if - g)dfi > ^ E - • 

k<ko 

By the argument of Lemma 2.7.8, we have 

^{ak+i - akfe~'' > ^ j (min(/', m + a^o+i) - m)^<i/x . 
fe<feo 

Thus the proof is completed if a^o+i > 1/2M. 
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Step 4. Assuming now ako+i < 1/2M, we show that 



(4.4.29) J (/' - g)di, > 

Since 

M2 



KM 



fe>fco 

combining with (4.4.28), we get 



f if - g)d^i > ^ - akfe-^ >^ hf-m) 

fe>0 



djjL 



by (the argument of) Lemma 2.7.8, completing the proof of Lemma 4.4.7. 

To prove (4.4.29), we observe that by definition of ko we have e~^°~^ < iJi{D). 
Consider the set 

Z = {V^(/.(.,Zfc„+i))<6}. 

Then, by (4.4.5), we have n{Z) > 1 - e-'^"-^, so that /i(Z f\D)> e'^^jK. Now, 
if a; e D, we have /'(a;) > m + 1/M while if a; ^ D, we have 

K^) < ako+i + m+ -^V(^(^, Zko+i)) 

1 6 

< m H 1 . 

2M L 

Thus g{uj) <m + ^ if L/M is large enough. Hence, f -g> I /AM on ZnD. □ 

Proof of Lemma 4 -4 -8- Step 1. We show that we can assume n{D) < 1/8. Indeed 
otherwise by Lemma 4.4.7 we have /(/' — g)dii > 1/KM and, since a < 8/7^L, 
this is > La^ when L/M is large enough. 

Step 2. By definition of a, there exists s < m with m — s > aw{iJ,{Bs))/2. By 
(4.4.5) and Chebyshev inequality, the set 

H = {'il;{hi;B,))<2 + wifiiB,))} 

has measure > 3/4. Thus if we set G = i7 n {C\D), we have /x(G) > 1/8. 
Step 3. Set 



3 + «;(/x(5,)) • 

Since w{jj,{Bs)) > w{l/2) > 7, and m — s> aw{^{Bs)) /2, we have 

Since //(G) > 1/8, it suffices to show that 

(4.4.30) Va;eG, f\u;) - g{uj)>^p\ 
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Step 4. We prove (4.4.30). Consider lo e G. Then /'(w) — f{uj) > m. Consider 
uj' e Bg with h{ijO,Lo') < 3 + w{n{Bg)). Then 

(4.4.31)/(a;) - /(a;) > sup {{1 - X){m - s) - - X){3 + w{f,{B,)))) . 

0<A<1 ^ 



We choose < A < 1 such that 



1 - A 



4 2 + w(ii{Bs)) 



This is possible since (3 < a < S/L'j'^. Then (4.4.31) yields, since ■ip{x) < for 
a; < 1, that 



/(a;) - /(a;) > 



8 



Thus /(w) < /(ct)) — L7^/3^/8. Since the right hand side is > m', we have g{ijj) < 
f{uj) - The proof is complete. 

4.5. Interpolation 

The result of this section will interpolate between (a weak form of) Theorem 
3.1.1, for q — 2, and (a weak form of) Theorem 4.1.1. Consider three points 

X = {xi)i<N, = {y})i<N, y"^ = {yi)i<N of n^. Set 

ri{x,y^,y^) = (l{^.^yi}, l{3,.^y2}, l{3;.^{j^i^j^2}}) . 
Thus riix,y\y'^) e {0, Ip. Set 

r{x,y\y^) = {n{x,y\y')),<M G ({0,1}^)^. 
Given two subsets Ai, A2 of fl^ , let 

Ua.M^) = Mx, y\ y^) ■,y^ eAi,y'' e A2} , 

and consider the convex hull Vai,A2(^) of ^^^1,^2(^)5 when Uaj^,A2{x) is seen as a 
subset of (M^)^. 

Throughout this section, we set 6 = 1/6, a = log(3 — 2e~^). We make the 
convention to write a point r G (M^)-^ as {ri^i,r2,i,rs,i)i<N- We set 



f{Ai,A2,x) = inf <j arf i + ar\ ^ + fers,^ ; r e 1/^1,^2(3;) 



i<Ar 



Theorem 4.5.1. l^e have 
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To understand better this statement, set u — f{x, Ai, A2). Consider r G Va^,A2 i^) 
such that 

i<N 

Consider numbers (ci,j)i<jv, {c2,i)i<N- Then, for j = 1,2 

1/2 

i<Ar 



Thus 



E + C2,ir2,i + 6r3,i) < t =: w + (^) ^ ( /E^^ + 

i<Ar \\ i<N 



X4. 



i<Ar 



If we recall that Vai,A2(^) is the convex hull of C/ai,A2(^); this implies that we 
can find E Ai, y'^ e A2 such that 

E"t^i.« ! 7^ } + E't'^2,i ; Xi i- yl} + 6card{z ; Xi ^ {y\,yl}} < t . 

The proof of Theorem 4.5.1 goes by induction over N. The case A?" = 1 is left 
to the reader. For the induction from to + 1, one observes, with the usual 
notations, that, when ao,o, oi,0: t*o,i) 0^1,1 ^ 0, are of sum one, then 

f{Ai,A2, {x,uj)) < ao^of{Ai{aj),A2{oj),x) + ai^of{Bi,A2{oj),x) 
+ ao^if{Ai{uj),B2,x) + ai^if{Bi,B2,x) 
+ &(ai,o + + 6(ao,i + ai,i)^ + aai,i . 

Thereby, to perform the induction it suffices to show that, when gi, g2 are two 
functions on Q, gi, g2 < 1, then 

f 111 

(4.5.]/) inf exp(aai,i + 6(ao,i + ai,if + 6(ai,o + 01,1)^)7 wT ^^rrd^i 

J {9i92r°-° g2 ' 9i ' 

^ 1 

~ J 9idfiJ 92dn 

where the infimum is taken over all the allowed choices of ao.o, cio,i, (11,0, 
Lemma 4.5.2. We have 

(4.5.2) inf exp(aai,i + 6(01,0 + ai,i)V 6(00,1 +ai,i)2)^-—^^ ^ ^ 
< V(3-2^i)(3-2^2). 
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We first use (4.5.2) to prove (4.5.1). By (4.5.2) and Cauchy-Schwarz, the left- 
hand side of (4.5.1) is bounded by 

^y"(3 - 2gi)dii J {3- 2g2)diJi ^ ^ {3 - 2 J gidi^){3 - 2 J g2di^) . 

Thus it suffices to observe that for < a; < 1, we have 3 — 2x < x~'^, which expresses 
the fact that the convex function is above its tangent at a; = 1. 

Proof of Lemma 4-5 -2. We will actually restrict the infimum to the cases ai,i = 1 
or ai^i = 0. We will prove 

(4.5.3) min(e- inf ^ e^(«?+«^)^ < ^ {3 - 2g,){3 - 2g2) . 

\ 0<ai+a2<l g^ g^ ^ ) 

We distinguish cases. 
Case 1. g\g2 < e~'^^. 
It suffices to see that 

e'^< V(3-2^i)(3-25r2). 

The right hand side has minimum ciX gi — g2 — ^ and our value of a has been 
chosen so that inequality holds in that case. 

Case 2. gig2 > e"^''. 

For j = 1, 2, we take aj = — ^^^f^- The purpose of the condition gig2 > e~'^^ is 
to ensure ai + a2 < 1. It suffices to show that 

1 (log 91 ) 



J. V'^S 91 f f— 

_e w- < ^3 - 2gi . 

91 

We will show that, for < x < 1, we have 

g-(log.)V26<^2(3_2^) 

or, equivalently that 

(p(x) = (^"S ^) + 2 log X + log(3 - 2x) > 0. 
2b 

Since </''(0) = 0, (/p(0) = it suffices to show that {xip'{x)y > 0, i.e. 

' >0. 



b (3 - 2xy 

But, since | = 6, it suffices to show that x < {3 — 2x)^, which is true since x <1, 
(3 - 2x)^ > 1. □ 



70 



M. TALAGRAND 



5. The Symmetric Group 

We denote by 5" at the group of permutations of {1, . . . A^}. Our interest in the 
symmetric group stems from the fact that it is closely related to a product. To see 
this, let us denote by tij the transposition of i and j. Then, it is easily seen that 
every a & Sn can be written in a unique way as 

(5.1) a = tN,i(N) ° tN-l,i(N-l) • • • O ^2,i(2) 

where, for j < we have < j. This decomposition allows to transfer some 
of the results of Chapter 2 to -^jv. The purpose of the present chapter is to prove 
a version of Theorem 4.1.1 for Sn- The reason for which this is not such an easy 
task is that the decomposition (5.1) is highly noncommutative. 

For a subset A of Sn, and cr e Sn, we set 

UA{a) ^ {s e {0,1}^ ; 3t e A;y£ < N , si = ^ t{£) = a{£)} 
and we consider the convex hull VA(cr) of C/^a(c) in [0, 1]"^. We set 

f{A, a) = inf J si ; s = (s,) G VA{a) 
y£<N 

We denote by P^v the canonical (= homogenous) probability on Sn- 
Theorem 5.1. For every subset A of Sn we have 

(5.2) y^^expl/(^,.)dP„W<^. 

In a natural way, Sn can be considered as a subset of {1,...,A^}^, by the 
map a — ^ {o'{i))i<N- If Sn were equal to all of {1,...,A^}^, (5.2) would be a 
consequence of Theorem 4.1.1; but Sn is only a very small subset of {1, ... , N}^ . 

The challenge of Theorem 5.1 is that it is apparently not possible to prove (5.2) 
by induction over N. Rather, we will use a stronger induction hypothesis. Given 

p < N, we set 



fiA, a,p) = inf i 4 + ^ s| ; s e F^(a) 



> . 



Theorem 5.1 is obviously a consequence of the following. 

Proposition 5.2. For each subset A of Sn and each p < N, we have 

[I 1 
(5-3)iv / exp— /(A,(j,p)dPiv((T) < p , 

JSn r'NKA) 

(5.4)iv / exp -^/(A, a, a-\p))dPN{a) < . 

Jsn -nv(^) 
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We let to the reader to prove Proposition 5.2 when = 1. We now assume that 
Proposition 5.2 has been proved for N and we prove it for A?" + 1. A noticeable 
feature of this proof is that the proof of (5.3)jv+i (resp. (5.4)jv+i) will require the 
use of (5.4)7v (resp. (5. 3) at). Before the proof starts, we need to introduce some 
notation. Given pjUi < N , p ^ m, we set 



We start the proof of (5.4) w+i. Certainly there is no loss of generality to assume 
that p = N+1. 

Lemma 5.3. Consider i,j<N + l,i^j,ae Sn+i, < A < 1. Then 



(5.7) /(A, a, z) < 4(1 - A)2 + (1 - X)g{A, a, z, j) + Xf{A, a, j, i) . 

Proof. Consider s e Va(c), ^ € VA{o^), with U = 0. By convexity of VA{o^), we have 

u = {1 - X)s + Xt e Vk(cT) . 



(5.5) 




Given i,j<N,weset 



(5.6) 




Thus 



f{A,a,i)<J2^j + 2ul 



Since Si < 1 and since ti — 0, we have 



fiA, a,i)<Y, + 2(1 - A)' + ((1 - X)sj + Xtjf . 



Since Sj < 1, we have 



((1 - X)sj + Xtjf < 2(1 - Xfs] + 2XH] < 2(1 - Xf + 2Xt] . 



Since the function x 



is convex, we have 



uj<{l-X)sj+Xtj. 



Thus we have 



fiA, a, i)<il-X)J2sj + x[ 2t] + + 4(1 - A)^ . 
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The result follows by taking the infimum over s,t. □ 

Following the idea of Theorem 4.1.1, (5.7) will be used together with Holder's 
inequality. Some work is, however, needed to relate the resulting terms to the 
induction hypothesis. For i < iV + 1, we set 

Gi = {ae Sn+1 ; (J{i) = N + 1}. 

For simplicity, we denote by ti — tN+i,i the transposition of + 1 and i. We 
consider the map R: p ^ p^U. We observe that, if p e G^, then 

R{p){N + 1) = p o U{N + 1) = p(z) =N + 1. 

Thereby, we can consider i? as a map from Gi to Sn- We set Ai = AoGi. 

Lemma 5.4. If a E Gi, we have 

(5.8) f{A, a, J, t) < f{R{Ai), R{a), . 

Proof. We let the reader consider the essentially obvious case where i = N + 1, 
and we assume i ^ N + 1. Given a sequence s e {0,1}^, we consider the sequence 
s — (si) e {0, l}-^"'"-'- defined by Si = 0, sat+i = Si, si = si ii £ ^ N + 1. 
We note that sg = St-(^i-j for i ^ i. Thus it suffices to prove that s G Ua{o') 
whenever s G [/ij(^.)(i?(a")). Consider s G Uji(^Ai){R{cr))- By definition, there exists 
T G R{Ai) such that, for £ < iV 

si = o^T{e)^R{a){e). 

Since r G R{Ai), we have r = R{p) for a certain p E Ai. Thus 

(5.9) se = 0^ p{U{e)) = a{U{e)). 
We will show that, for £ < AT + 1 

Si = 0^ p{e) - ail) . 

This holds for i = i, since p{i) = (T{i) = N + 1. For £ i, this follows from (5.9), 
since S£ — and ti o ti is the identity of Sn- □ 

We denote by Qi the uniform probability on Gi. 

Corollary 5.5. 

(5.10) J e^p^f{A,a,j,i)dQi{a) < — ^ 



1 



Qi{Ai) Qi{A) • 

Proof. Using (5.8), the left-hand side of (5.10) is bounded by 

exp^f{R{Ai),R{a),ti{j))dQi{a) = J exp ^f{R{Ai), p,ti{j))dPN{p) 



< 



PNiRiAi)) QiiA) 

using (5.3)Ar. □ 
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Lemma 5.6. Assume j ^ i- Then 

f 1 1 

(5.11) J exp Y^giA cT, i, j)dQi{a) < g-^y • 

Proof. The map S: p — > p o tij is one to one from Gj to Gi. We will prove that 
setting B = R{S{Aj)), we have 

(5.12) g{A,a,i,j)<fiB,R{a)) 

where we recall that R is seen as a map from Gi to S^. Since Pjs[{B) = Qj{A), 
(5.11) will follow from either (5.3) at or (5.4) tv as in the proof of Corollary 5.5. 

Given a sequence s G {0, 1}^, we consider the sequence s G {0, 1}^+^ defined as 
follows. We set Si = Sj — 1. If A'" + 1 ^ i, j, we set sn+i — Si. If £ ^ {i, j, N + 1}, 
we set S£ — S£. 

We will show that when s G Ub{R{o')), then s G Ua{o'). By definition of 
UB{R{(y)), there exists t E B such that 

= =^ t(£) = R{a){e) = ao ti{l) . 

Since T E B, we can write t = potij oti, where p E Aj. Thus 

Si = ^ potij o ti{£) = aoti{£) . 

We will show that ior £ < N + 1 we have 

se = 0^ p{£) = a{£) . 

The only nontrivial case is £ = iV + 1, when N -{- 1 ^ i, j . In that case, when 
^iv+i = 0, we have si — 0, so that T{i) — R{a){i) — a{N + 1). But 

t(z) = po tij o ti(i) =po tij(N + 1) = piN + 1) . 

since N + 1 ^ □ 

We now complete the proof of (5.4)jv+i. We select j such that Qj{A) is maxi- 
mum. If z < + 1, i 7^ j, for < A < 1, we have, using Lemmas 5.3, 5.4, Corollary 
5.5 and Holder's inequality 



exp ^/(^) i)Qi{(T) < exp 



i(A-l)^ 



1 



QiiA)^ Qj{A) 



l-A 



QM)Y^ 1.. ,.2 

' exp 7(1 — A) . 



QjiA)\Qj{A)J "4 
If we appeal to Lemma 4.1.3, we have 

(5.13) |expi/(A..,W.W<^(2-|M) 
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It should be obvious from the induction hypothesis that (5.13) stiU hold for i — j. 
Since Pn+i = J2i<N+i N^Qi^ have, from (5.13), and since i = a~^{N + 1) for 
a e Gi that 



J exp^fiA,a,a-\N+l))dPM+iia) < (^2 



Pn+i{A) 



-Pn+i{A)- ° 

Having proved (5.4)jv+i, we turn towards the proof of (5.3)Ar_|_i. We can assume 
again p = N + 1. The proof is not identical to that of (5.4)Ar_|_i, but is completely 
parallel. 

Lemma 5.7. For a e Sn+i, j < N + 1, j ^ a{N + 1), < X < 1, we have 

{5.mA,a,N+l) 

< 4(1 - Xf + (1 - A)^(^, a,N+l, a-\j)) + Xf{A, a, a-\j),N + 1) . 

Proof. This is (5.7) if one replaces i by + 1, j by cr~^(j). □ 
We set 

G', = {aeSN+i;a{N+l) = i}. 

We fix i, and we consider the map R':p ti o p. Thus, for p e G[, we have 
R'{p){N + 1) = ti{i) = N + 1, and we can view R' as a map from G[ to Sn- We 
set A'i = AnG[. 

Lemma 5.8. If a E G'^, i ^ j, we have 

(5.15) /(A ^, CT-\3). N + l)< f{R'{A',), R'{a), R'{a)-\um . 

Proof. Given a sequence s G {0,1}^, we consider the sequence s = {si) G {0, 1}-^+^ 
defined by = if £ 7^ + 1, and sn+i = 0. Since a~^{j) = R' {a)~^{ti{j)) ^ 
N + 1, it suffices to prove that s e [/a(c) whenever s G Uji'(A'.){R'{cr)). Thus, 
consider s in this later set. By definition, there exists r G R'{A'-) such that 

v£ < iv , = =^ T{e) = R'{<j){t) . 

Since T G R'{A'^}, we have r = R'{p), p G A[. Thus, 

Ve<N, se = 0^tiO p{e) =tiO a{£) => p{e) = a{e) . 
Since p{N + 1) = o-{N + 1) = i, we then have 

V£<N + l,Si = 0^ p{£) = a{e) . 
Thus s G C/a(ct). □ 
We denote by the homogeneous probability on G'^. 
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Corollary 5.9. If j ^ i, 



(5.16) J exp ^f{A, a, a-\j), N + l)dQ',{a) < ^ 



(A) 



Proof. Using (5.15) and the fact that R' transports Q[ to Pjv, the left-hand side of 
(5.16) is bounded by 



J exp ^f{R'{Ai),p,p-\u{j)))dPN{p) 



1 1 

< 



Pn{R'{A,)) Q'M) 
using (5. 4) AT- □ 
Lemma 5.10. If i ^ j, we have 

(5.17) J exp ^g{A, a, N + 1, a-\j))dQ[{a) < . 



Proof. The map S': p ^ tij o p is one to one from G'j to G[. We will prove that, 
setting B = R' o S'{Aj), we have, for a in G'^ that 

(5.18) g{A, a,N+l, a-\j)) < f{B, R'{a)) 

where we recall that R' is seen as a map from G'^ to S^- Since Pn{B) = Qj (A), 
(5.17) will then follow from either (5.3)jv or (5.4)jv. 

Given a sequence s e {0, 1}^, we consider the sequence s e {0, 1}^+^ defined 
as follows. We set sn+i = Scr-i(j) = 1. We set S£ = se ii £ ^ {N + l,a~^{j)}. 
To prove (5.18) it suffices to prove that if s G UB{R'{(y)), then s e Ua{o')- Thus, 
consider s e UB{R'{cr)). By definition, there exists t e B such that 

(5.19) se = 0^ r{t) = R'{a) {£) = U o a{£) . 

Since T e 5, we can write r = o t^j o p, where p e A'j. Thus, by (5.19) 

si^O^ tij o p{£) = a{£) ^ p{£) = tij o a{£) . 

Now, for £ ^ N + 1, a-^{j), we have a{£) ^ ij; thus tij o a{£) = a{£). Thus for 
these values of £ we have 

Si^O^ Si^O^ p{£) = a{£) . □ 



The end of the proof of (5.3)jv+i is similar to the end of the proof of (5.4)jv+i, 
and is left to the reader. 
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6. Bin Packing 

Given a collection xi, . . . , xn of items, of sizes < 1, the bin packing problem 
requires finding the minimum number Bn[xi, . . . , Xn) of unit size bins in which 
the items xi, . . . , xn can be packed, subject to the restriction that the sum of the 
sizes of items attributed to a given bin cannot exceed one. (For simplicity, we will 
denote items and item sizes by the same letters.) The bin packing problem is a 
fundamental question of computer science, and, accordingly, has received consider- 
able attention. Much work has been done on stochastic models [C-L] . In the model 
we will consider, the items Xi, . . . , Xn are independently distributed according to 
a given distribution One of the natural questions that arises is the study of 
the fluctuations of the random variable Bn{Xi, . . . , X^). One early result, [R-Tl], 
[McDl], using martingales, is that for alH > 0, one has 

(6.1) P{\BN{Xu...,XN)-EBN{Xi,...,XN)\>t)<2exp 

However, especially when EXi is small, one expects that the behavior of B]sf{Xi, . . . , Xtv) 
resembles the behavior of ^ Xi. Thereby one should expect that the exponent 

i<N 

in the right-hand side of (6.1) should be of order t^/A^var(Xi), or, at least, less 
ambitiously, t^/NE{Xf). This is apparently not so easy to prove, and despite sev- 
eral attempts, was established only recently using non-trivial bin-packing theory 
[Rl], [R2], [R3]. The purpose of the present section is to prove this result as an 
application of Theorem 4.1.1. Several features of the proof will appear repeatedly 
in future applications. One advantage of our approach is that it uses only trivial 
facts about bin packing, such as the following observation. 

Lemma 6.1. We have 

Bn{xi,..., Xn) < 2 -H . 

i<N 




Proof. It suffices to construct a packing in which at most one bin is less than half 
full. Such a packing exists since bins that are less than half full can be merged. □ 

We take fl = [0, 1]. For a subset A of fi^, and x e O^, we recall the notation 
fc{A,x) introduced in Section 4.1. For x = {xi, . . . ,xn) £ , we write simply 

^ 1/2 

Bn{x) rather than Bn{xi, . . . ,xn)- For x e , we set ||a;||2 — { J2 



I 



Finally, for a > 0, we set 

Aia) = {yen'';BN{y)<a}. 
The crucial observation is as follows. 
Lemma 6.2. For all x e O,^ , we have 
(6.2) Bn{x) <a + 2||a;||2/c(^(a), x) + 1 . 
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Proof. As follows from Lemma 4.1.2, (taking ctj there equal to Xi) we can find 
y e A{a) such that, if / denotes the set of indices i < N ior which Xi = j/j, we have 

^Xi< ||a;||2/c(^(o),a;) . 

By Lemma 6.2 the items {xi)i^i can be packed using at most 2||a;||2/c(^(a)5 a^) + 1 
bins. The items (xj)jg7 are exactly the items (yi)^^/, so they can certainly be 
packed using at most a bins, since y e B{a). The result follows. □ 

We provide [0, 1] with the measure fi, and we denote by P the product proba- 
bility on n^. The term ||a;||2 of (6.2) will be disposed of by the following simple 
observation. 

Lemma 6.3. We have 

(6.3) ^^(||a;||2 > 2^/N{EXfy/^) < exp{-2NEXf) . 



Proof. Since < 1 + 2a; for a; < 1, we have 
so that 



E exp <1 + 2EXf < exp 2^X1^ 



£;exp ^ Xf < exp2NEXl 



.i<N 

for which (6.3) follows by Chebyshev inequality. □ 

We can now prove the basic inequality. 

Proposition 6.4. We have, for all t > 0, all a > that 

{6.4)P{BNix) < a)P{BN{x) > a + 2ty/N{EXlf/'^ + 1) < e"*'/^ + g-^^^^i . 

Proof. Indeed, by (6.2), iIBn{x) > a+2t^/N{EXfy/^+l, we have either fc{A{a),x) > 
t or ||a;||2 > 2y/N{EXiy/^. The result then follows from (4.1.2) and (6.2). 

Theorem 6.5. Denote by M a median of Bn{x). Then for all u < A\/2NEXl 
we have 

P{\Bn{Xi,...,Xn)-M\ >1 + u)< 8exp ' 



IGNEXfJ ' 

Proof. First, we take a = M to obtain from (6.4) setting u — 2t\fN[EXlY/'^ , since 
P{Bn <M)> 1/2, 

P{Bn >M + u + l)< 2(e-*'/4 + e-2^-^^?) 

< 4e-*'/^ . 

The bound for P{Bn < M — u — 1) follows similarly taking a — M — u — 1. □ 



Remarks. 1) One can also get bounds for larger values of tt, by adapting Lemma 
6.3. 

2) It is instructive to find an alternate proof of Theorem 6.5 using Corollary 2.2.4 
rather than Theorem 4.1.1. 
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7. Subsequences 

7.1. The longest increasing subsequence 

Consider points Xi, . . . , X]y of [0, 1]. Wc denote by Lj^{xi, . . . , xtv) the length of 
the longest increasing subsequence of xi, . . . , xat. That is, the largest integer p such 
that we can find ii <■■■< ip for which < • ■ • < Xi^. It is simple to see that 
when Xi, . . . , Xn are independent uniformly distributed over [0,1] (or, actually, 
distributed according to any non atomic probability), the r.v. Ln{Xi, . . . ,Xn) is 
distributed like the longest increasing subsequence of a random permutation a of 
{1, . . . , A^} (where the symmetric group Sn is of course provided with the uniform 
probability). The concentration of Ln{Xi, . . . , Xn) around its mean has been 
studied in particular in [F] and [B-B]. Sharper results will be obtained here as a 
simple consequence of Theorem 4.1.1. We consider Q = [0, 1]-^. For x = {xi)i<N in 
Q, we set Ln{x) = Ln{xi, . . . , xn)- For a > 0, we set 

A{a) = {x eft ■,Ln{x) < a} . 

The basic observation is as follows. 

Lemma 7.1.1. For all x e fl^ , we have 

(7.1.1) a > Ln{x) - fc{A{a),x)./lMx). 

In particular, 

(7.1.2) LN{x)>a + v^fc{A{a),x)>—^=. 



Proof. For simplicity, we write h = L]y{x). By definition, we can find a subset / of 
{1, . . . , N} of cardinality b such that if i,j G /, i < j, then Xi < xj. By Lemma 
4.1.2 (taking ctj = 1 if z e / and ctj = otherwise), there exists y e A{a) such 
that cardJ < fc{A{a), x)Vb, where J = {i E I ; yi ^ Xi}. Thus (a^i)ig/\j is an 
increasing subsequence of y; since y e A{a), we have card(/\J) < a, which proves 
(7.1.1). 

To prove (7.1.2), we observe that by (7.1.1) we have 

\/Ln{x) 

and that the function u ^ {u — a)/y/u increases for u > a. □ 

We denote by M {— M^) a median of L^. 
Theorem 7.1.2. For all u > we have 

(7.1.3) P(Ljv > M + w) < 2exp- 



4(M + u) 

(7.1.4) P(Ljv < M-«) < 2exp- 



«2 



4M 
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Proof. To prove (7.1.3), we combine (7.1.2) with M — a and (4.1.2). To prove 
(7.1.4), we use (7.1.2) with a = M — u, v = u to see that 

u 

Ln{x) > M =^ fc{A{M - u), x) > 



/M 

so that 

(7.1.5) p(^f^(A{M-u),x)>-^^ >^ 

On the other hand, by (4.4.2), 



(7.1.6) P(fJA(M-u),x)> ^] <——^ -e-fM . 

^ ^ ^ ^ ' - x/m; - P{A{M-u)) 



„2 



Comparing (7.1.5), (7.1.6) gives the required bound on P{A{M — u)). □ 

It seems worthwhile to state an abstract version of Theorem 7.1.2. Let us say 
that a function L^'- — > N is a configuration function provided it has the following 
property. 

(7.1.7) Given any x = {xi)i<N in ^^^5 there exists a subset J of {1, ... , N} with 
cardJ = Ln{x) such that, for each y in , we have Liq{y) > card{z & J ; yi = Xi}. 

The reason for this name is that, intuitively, Ln counts the size of the largest 
"configuration" formed by the points Xi. 

The proof of the following is identical to that of Theorem 7.1.2. 
Theorem 7.1.3. If Ln is a configuration function, then (7.1.3) and (7.1.4) hold. 
7.2. Longest common subsequence 

Consider two sequences x — {xi, . . . ,xn), y = (j/i, • • • , J/at) of numbers. We 
define the length LN,M{x;y) of the longest common subsequence of the 
largest integer p for which there exists 1 < ii < ■ ■ ■ < ip < N and 1 < ji < 
• • • < jp < N' such that a^j^ = yj^ for each £ < p. One interpretation of this 
is when chosen among a (small) finite number of possibilities (the 

letters of an alphabet) L]sf^N'{x;y) is then the length of the longest "subword" 
of the words a;, y (and N + N' — L^^^'^x^y) is the so-called "edit distance" of 
the two words). These considerations arise in a number of situations, such as 
genetics, speech recognition, etc. Consider now a r.v. X, and two independent 
sequences [Xi)i<_N, 0^j)j<N' independently distributed like X. We are interested 
in the random variable Ln^n' = Ln,n' (-^i, • • • , -^at ; i^i, • • • , ^at')- 

Theorem 7.2.1. Consider a median M (= Mn,n' ) of Ln,n' ■ Then for all tt > 0, 

we have 



(7.2.1) P(Ljv,iv' > M + w) < 2exp 



(7.2.2) P(Liv,Ar' < M-u) < 2exp 



»2 



32(M + 'u) 

.,2 



32M j 
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Comments. It is known that lim E{Ln^n)/N exists. However, this Hmit can be 

very small, in the case where X takes many possible values. In this case, we have 
M <^ N, and (7.2.1), (7.2.2) give a better result than Azuma's inequality. 

Proof. The proof is very similar to the proof of Theorem 7.1.2. Consider Q. = [0, 1], 
and for x e , consider 

L{x) = Ln^n'{xi, . . . , xn ; xn+1, ■ ■ ■ , xn+n') ■ 

Consider the set 

A{a) = {x ; L{x) < a} . 

The basic inequality is that 

(7.2.3) a > L{x) - 2^2 fc{A{a),x)^/L{x) . 



To see this, we set b = L{x); we can find indices 

l<zi<---<Z6<iV< Zb+i <■■■ <i2b<N + N' 

such that Xi^ — x^^,^^ for 1 < /c < 6. Consider the set / = {ik ', 1 < k < 2b}. By 
Lemma 4.4.2, we can find y e A{a) such that 

(7.2.4) card{z e I ; Xi ^ y^} < fc{A{a), x)V2b . 

Consider then 

J = {k <b; Xij^ = yi^ ; Xi^^^ = yik_^_i,} ■ 

By (7.2.4) we see that 

cardJ >b - 2card{i e I ; Xi yi} > b - 2V2bfc{A{a),x) . 

On the other hand, L{y) > card J since, for k & J, we have yi^ = yi^+b- Also, since 
y G A(a), we have L{y) < a. Condition (7.2.3) follows. The rest of the proof is 
identical to that of Theorem 7.2.2. □ 



Remark. It is also possible to find a more general version of Theorem 7.1.3 that 
would contain Theorem 7.1.2. 
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8. Infimum and Percolation 

Consider an independent sequence {Xi)i<N of positive r.v. Consider a family 
of A^-tuple a — {ai)i<N of positive numbers. Our prime topic of interest in the 
present section is the random variable 

(8.1) Z' = Z'j.= inf V aiXi . 

i<N 

It does matter a lot that we take an infimum rather than a supremum. The function 
of the ' in Z' is to indicate that we take such an infimum. Rather that (8.1) one 
can also write 

Z' = - sup ^ {-ai)Xi 

but the numbers — ctj are negative. In Section 13, we will have to study the r.v. 

(8.2) Z = sup ^ aiXi 

where and Xi can possibly have any signs. In order to avoid repetition, we will 
study the variables Z given by (8.2). 

8.1. The basic result 

Consider a family T of n-tuples ex. = {ai)i<N. We make no assumption on 

f V' 

the sign of ai. We set a = sup ||q;||2, where ||q;||2 = I j . We consider 

ae:F \i<N J 

independent r.v. Xj, and we assume that for each i there is number such that 

ri<Xi<ri + 1. 

Theorem 8.1.1. Consider the r.v. Z given by (8.2), and a median M of Z. Then, 
for all u> 0, we have 

(8.1.1) P{\Z - M\>u) <4exp(^-^^ . 

Proof. This will again follow from Theorem 4.1.1. 

Step 1. Set O, = [0, 1], and for x = {xi)i<N G fl^ , set 

Z{x) = sup ^ ai{ri + Xi) . 

Consider a e R, and A{a) = {y E fl^ ; Z{y) < a}. The basic observation is that 

(8.1.2) Vx e , Z(x) <a + afc{A{a),x) . 

To prove this, consider a E J-'. By Lemma 4.1.2, we can find y e A{x) such that, if 
I = {i < N ; yi ^ Xi}, then we have 

(8.1.3) < ||«||2/c(^(a),x) < a/e(^(a),a;). 
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< X] \yi -Xi\<^ \ai 

iel i€l 



We then have 

^ ai{ri + yi)-^ ai{ri + Xi) 

i<N i<N 

Thus, by (8.1.3) 

^ aiin + Xi) < Z{y) + afc{A{a),x) <a + afc{A{a),x) , 

i<N 

and taking the supremum over a proves (8.1.2). 

Step 2. We provide the i*^ factor [0, 1] with the law Hi of — ri. We denote 
by P the product probability. Thus by (8.1.2) and (4.1.2) 

P{Z{x) >h)<P f /e(A(a), a;) > < ,\ exp ^ ~ ""^^ 



I.e., 



P{Z{x) > h)P{Z{x) <a)< exp 



P{A{a)) 

(b - a)2 



4cr^ 



from which (8.1.1) follows as in Chapter 7, since law of Z{x) under P coincides 
with the law oi Z. □ 

8.2. General moments 

In the present section we rely on the theory of Section 4.4. We start with 
some preliminaries. Consider a convex function on R"*" that satisfies (4.4.2) and 
ip{0) = 0. Consider a family of A'"-tuples as in Section 8.1. For it > 0, we define 



i/jj^iu) = inf < ij{si) ;3aeT, Si\ai\ > 



u 



i<N 



i<N 



The simplest case is when 'ip{x) = x^ . In that case it is easily seen that V'J^(m) = 
v? /a'^^ where cr^ = sup{||Q;||2 ; a G T}. The most interesting case is arguably the 
case where = V'o is given by 

ipoix) = if a; < 1 ; ipoix) = 2a; — 1 if a; > 1 . 

If we set 

T = sup{|Q!j| ; i < N ; a E , 

we note that, for given a & for each s = (sj)i<jv, setting J = {i < N ; Si < 1} 
we have 

Si\ai\ = ^ Si\ai\ + ^ Si\ai\ 

i<N ieJ i^J 

\i€J / i^J 
1/2 



,i<N 



i<N 
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Thus, if ^i<NSi\ai\ > u, then either YliKN'^i^i) ^ w^/4<t^, or else ^ •(/'(•Si) > 

~ i<N 

t/2, and thus 

The basic observation is as foUows. 

Proposition 8.2.1. Consider T , i/j as above. Set O = M, and consider the function 
Z{x) = sup J2 Oii^i- Consider a e M, and A{a) = {y ; Z{y) < a}. Then 

aeJ^i<N 

(8.2.2) e , U,^{A{a),x) > *^(Z(a;) - a) 
when the function h is defined onM. xM. by 

(8.2.3) h{uj,uj') = \u -u'\. 

Moreover, when ai <0 for each i < N and each a E J-', we can take 

(8.2.4) h{uj,u;') = {u;' -u;)+ . 

Proof. By definition of fh,ii), given e > 0, we can find s e VA(a){^) such that 

i^{s^) < fh,^{A{a),x) + s. 

i<N 

Consider a = (a^) e J^. Then there exists s' e UA{a){x) such that < 

i<N 

^ Ictjlsj. This means that there is y e A{a) for which J2 \<^i\h{xi, yi) < ^ |Q!i|si, 
i<N iei i<N 

where I = {i < N ; Xi ^ j/i}. Now 

(8.2.5) aiXi = ^ aiyi + ^ ai{xi - yi) . 

i<N i<N i£l 

We have ai{xi-yi) < \ai\ \xi-yi\. If cci is < we have ai{xi~yi) < \ai\{yi-Xi)^ . 
Thus in aU cases under consideration, we have 



Yai{xi - yi) <Y \ai\h{xi,yi) < ^ \ai\s 

i€l iel i<N 

SO that, by (8.2.5) 

o^iXi < ^ aiyi + X] \(Xi\si 

i<N i<N i<N 

i<N 

Taking the sup over a yields 



ai s 



1 1"-"* 



sup \(Xi\si > Z{x) — a 
and the result follows by definition oi'^j^. □ 
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Corollary 8.2.2. Consider a family T of N -tuples a — {ai)i<N- Consider a se- 
quence of independent r.v. {Xi)i^]sr with common law /i. Assume that (4-4-6) holds 
(for a certain function 9) when P = /i^^ , and where h is the function determined 
in Proposition 8.2.1. Then the r.v. Z = sup ^ OiiXi satisfies 

(8.2.6) w > ^ P{Z > M + w) < exp (6' ( M - 4*^(u) 



^ 2J K 

(8.2.7) « > ^ P{Z <M-u)<i(^^^jr{u)-\og2 

where M is a median of Z . 

Proof. Using (4.4.6) and Chebyshev inequality, we have 

P{fhAMo):X) >t)< exp [e{P{A{x))) - ^ 

where A{a) is the set of Proposition 8.2.1; thus, by (8.2.2) setting t = ^^^^(6 — a), 
for Z{x) > b we have fh,^{A{a), x) > t, so that 

P{Z >b)< exp (^9{P{Z < a)) - -^*^(6 - a)^ . 



Taking a = M, b = M -\-u imply (8.2.6). Taking b = M, a = M -u imply 

^ < exp (9{P{Z <M-u))- -^*^(w)) 

from which (8.2.7) follows. □ 
We now go back to our main line of study, that of the r.v. Z' = sup ^ {—ai)Xi. 

aeJ^ i<N 

In order to apply Corollary 8.2.2, we need (4.4.6) for the penalty function h{x, y) = 
{y — x)'^ . Since Xi is positive, its law /i is supported by M"*". Thereby, only the 
properties of h on M"*" x ]R+ matter; but then [y — x)^ < y. Thus, to have (4.4.6) 
it suffices that the function h{x,y) = y satisfies the conditions of Theorem 4.4.1. 
The case where the function h{x,y) depends on y only has been discussed after 
Theorem 4.4.1. Thus, we have proved the following. 

Theorem 8.2.3. Consider a family T on N -tuples of positive numbers, and inde- 
pendent identically distributed nonnegative r.v. variables {Xi)i<N. Consider func- 
tions 9, ^, w as in Theorem 4--4--1- Assume that (2.6.1), (4-. 4-. 2), (4.4-3) hold. 
Assume that condition H{$^, w) holds. Assume that the median m of X\ is < 1, and 
that for t>m, we have 



(8.2.8) 



w{P{X^ > t)) > i;{t) . 
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Then if M is a median of Z' — inf ^ oiiXi, the following holds (where the 

^ i<N 

constant K depends only on the parameter of ^ of Theorem 

(8.2.9) u>0^ P{Z' <M -u)<exp (^9 - -^*^(w)^ 

(8.2.10) u>0^ P{Z' >M + u)<^ (^-^*^(u) - log2^ . 



Comment. A striking feature of this result is the different forms of (8.2.9) and 

(8.2.10) . This phenomenon is weU known in the case where consists of single 
point a. In that case, is a sum of positive independent r.v. Yi. The lower tails 
of Z have a tendency to be "subgaussian" ([H]) while the upper tails of Z certainly 
depend much on the upper tails of the variables Y^. 

Corollary 8.2.4. There exists a universal constant U with the following property. 
Assume that satisfies (4-4-2)- Assume that 

(8.2.11) > 1 , P{Xi >t) < exp(-2V'(t)) . 
Then we have 

(8.2.12) u>0^ P{\Z -M\>u)< 3exp (^-^^j^{u)^ . 



Proof. We take ^{x) = e ^, 9{x) = — logx. According to Proposition 2.6.1, con- 
dition H{^,w) holds if / e'^dX < 2, so in particular, if w{t) = — |logt. Also, 
by (4.4.2), -0(1) = 1, so that (8.2.11) implies that the median of Xi < 1. Thus 
CoroUary 8.2.4 follows from Theorem 8.2.3. 

Corollary 8.2.5. Assume that (2.6.5) holds for a certain number L. Then, for 
some constant K depending on ^ only, if for allt>l we have 

(8.2.13) P{X^>t)<^\e{m)\ 

then (8.2.9), (8.2.10) hold (for a constant K depending on ^ only). 

Proof. We simply have to find a function w that satisfies (8.2.8) and such that 
condition H{^, w) holds. It follows from Proposition 2.6.3 that if we take R large 
enough (i? can actually be taken depending on L and ^ d\ only) then the 
function w such that 

V6>c |{^>6}| = i|mi 
satisfies condition H(^,w). Now, if we take K = R in (8.2.13), then 



\{w>m}\ = -5\^\m)\>nxi<t), 
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SO that w{P{Xi < t)) > ip{t) since w is non-decreasing. □ 

We now explain why CoroUary 8.2.5 is sharp. Consider the case where consists 
of the single element a = (ct^), where = 1/\/N. Consider ijj such that ip{x) = 
if X < 1 and ip{x) = 2x - 1 for x > 1. Then, for u = Vn , '^j^{u) > 7V/4 by (8.2.1). 
Consider a r.v. Xi e (0, A^}, with 

P{X,^N)^p=:^\^'{2N-l)\. 

Under condition (2.6.5), we have lim x^'{x) = 0, and it is not a restriction to 

X — >oo 

assume Np < 1/2. Thus the median of Z = N~^f'^ Si<7V"^» zero. 
Now 



P{Z >u) = P 



( 1 \ 

-^y^ Xi>u\ 



N 

Np=-e{2N-l) 



and the bound ^ (^) of (8.2.10) is indeed reasonably good, as x^'{x) is of order 
^{x) for many choices of ^. 

8.3. First time passage in percolation 

Consider a graph (F, E) where V is the set of vertices, E the set of edges. Assume 
that we have a family {Xg,)eeE of positive r.v. distributed liked a given r.v. X. (Xg 
represents the passage time through edge e.) Consider a family S of sets of edges; 
and for S E S, consider Xs = J2 -^e- In the case where is a path, i.e., consists 

e€S 

of the edges Cy^^y^, Cy^y^, . . . , e^,^._^^„j. linking vertices vi, . . . , v^, Xs represents the 

"passage time through S" . Let us set Z'^ = inf Xs- Let us set r = sup cards'. 

ses ses 

Denote M a median of Zg. The following is a consequence of (8.2.1) and Corollary 
(8.2.4). 

Proposition 8.3. There exists a universal constant K such that if E exp K~^X < 
2, we have 



(8.3.1) Vu>0, P{\Zs - M\>u) <4exp{ -^inm(—,u 



K \ r 



Consider the case where V = 7?, E consists of the edges that link any two 
adjacent vertices. Denote by S the sets of self-avoiding paths linking the origin to 
the point (0, n); and by S{C) the subset of S consisting of paths of length < Cn. H. 
Kesten [Kl] proved that if P{X = 0) < ^, then, for some constant C independent 
of n, we have 

P{Z's = Z's^a))>^-Ce-^^'' . 
It then follows from (8.3.1) that for some constant C" independent of n, we have 

ft f vP' \ 

(8.3.2) " ^ ^ ^(1^5 -M\>u)< 5exp ( J • 
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This improves recent results of H. Kesten [K2], based on the use of martingales, 
who proves (8.3.2) with an exponent ujC \/n. It should, however, be pointed out 
that the reason why martingales allow some success on this problem is because 
we consider only sums of the type ^cteXe for very special families a. = (cte)- 
Martingales are apparently powerless to approach Corollary 8.2.5. 

It is pointed out in the literature that (in the case y = Z2) (8.3.2) apparently 
does not give the correct rate. In view of Corollary 8.2.5, the obvious approach to 
improve (8.3.2) would be to show that Zs is very close to Zjr, where the family 
of sequences {ae)eeE satisfies a = sup ||ct||2 ^ n. There is an obvious candidate 

for JF. Indeed, consider the family JF', defined as follows: JF', seen as a subset of 
(R+)^, is the convex hull of the family of points as given by as{e) = 1 i{ e E S 
and as{e) = if e ^ 5', for all S e S. Then, obviously, Z'^ = Z'-p,. Then consider 
the family J-'{cr) of sequences {ae)e€V of T' for which ||q;||2 < ct- Then Z'^ < Z'-^f^^y 
Thus if one could show that for some a = o(n), and still have ^^(g.) — ^'s + o(yn), 
with probability 1 — o(n~^), one would obtain that the likely fluctuations of Z^ 
from M are o(y^). Roughly speaking, this means that the shortest passage time 
from (0, 0) to (0, n) is (within o(i/n)) obtained through a number of rather disjoint 
paths. Proving such a statement is apparently a long range program in Percolation 
theory. 
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9. Chromatic Number of Random Graphs 

The use of martingales has aUowed several important progresses in the under- 
standing of the chromatic number of random graphs. Use of martingales does 
require ingenuity. This chapter will demonstrate that Theorem 4.1.1 achieves some- 
what better results than martingales in a completely straightforward manner. 

For simplicity we call a graph G with vertice set V = {1, . . . ,n} a subset of 
£^0 = {{hj) ; i < j}- If ihj) belongs to G, we say that i,j are linked by an edge. 

A subset / of F is called independent if no two points of / are linked by an 
edge (the word independent there should not be confused with its probabilistic 
meaning). The chromatic number x(G', A) of a subset A of is the smallest number 
of independent sets that can cover A; that is, the vertices of A can be given A) 
colors so that no two points with the same color are linked by an edge. We set 



Given p, < p < 1, the random graph G = G{n,p) is defined by putting each 
possible edge in G with probability p, independently of what is done for the 
other edges. 

The chromatic number is remarkably concentrated, as the following shows. 

Theorem 9.1. Consider k & N and t > 0. Then there exists an integer a such 
that 

P{x{G{n,p),m) e [a-k,a]) 
(9.1) > 1 - 2e-*'/^ - P(sup{x(G'(n,p), F) ; F C V , cardF < tyfm} > k) . 

Comments. 1) The last term is always zero for k > t^Jm. But when p = 
(a > 0), it is still small for smaller values of k. See [S-S], [A-S, p. 88]. 

2) Another version of this Theorem could be proved, in the spirit of Theorem 
7.1.3, concerning the concentration property of the number 



3) With a bit of care, we can replace m by m — 1 in the right hand side of (9.1), 
and improve the coefficient 1/8. 

Proof. We set 

b = P(svLp{xiG(n,p), F) ; cardF < t^/m} > k) . 
We then define a as the largest integer for which 



x{G, m) = inf{x(G, A) ; card^ = m} . 



max{cardF ; x{G{n,p), F) < m} . 



(9.2) 



PixiGin,p),m)>a)>e-' '^ + h. 



Thus 



(9.3) 



P(x(G(n,p),m) > a) <e 
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In order to apply Theorem 4.1.1, we must represent the underlying probability 
space as a product space. The first idea that comes to mind would be to use {0, 1}^° ; 
this is not a good choice. For 2 < j < n, set = {0, 1}-'"^. Set ft' = Yl ^j- 

2<j<n 

We write u E as {ujj)j<n, where Uj — {i^i,j)i<j-i G Oj. To u> we associate the 
graph G{(jij) such that, for i < j, e G(u) if and only if ojij = 1. The only 

property of G{n,p) we need is that it is distributed as G{u!) for a certain product 
measure P on J| ^Ij. 

j<n 

Define A <Z fl' as the set of uj for which 

xiGiuj), m)>a; sup{x(G'(a;), F): cardF < ty/m} < k . 

Thus by (9.2) we have P{A) > e~* Combining Theorem 4.1.1 and Lemma 4.1.2, 
we see that P{B) > 1 — where we have set 



(9.4) S = {a; ; V(aj)2<,-<n , 3a;' e ^ ; ^ < \ Yl " 



To finish the proof, it suffices to show that 
(9.5) UJ e B ^x{G{uj),m)>a-k. 



So, consider uj e B, and set r = x(G(a;),m). Consider a subset F oi V, of 
cardinal m, such that x{G{<^)iP) = ^- We use (9.3) with aj = 1 if j e F and 
zero otherwise. Thus there is uj E A such that if J = {j e F ; ujj ^ uj'^}, then 
card J < t\/m. But obviously, 

x{G{uj'),F\J) = x{G{uj\F\I)<T 

and thus 

a<x{G{uj'),F)<r + xi.G{uj')J) 

<r + k. □ 



In order to obtain an upper bound for xg^ the most obvious approach is the 
"greedy" one: one chooses an independent set Wi of maximal sizes, and remove 
its vertices and all edges adjacent. One is then left with a graph on fewer vertices, 
and one iterates the process until exhaustion. To make this approach work one 
needs a competent bound on the probability that a random graph contains at least 
one independent set of size r. Such bounds were first obtained by B. BoUobas [B], 
using martingales. A recent powerful correlation inequality of Janson [J] is both 
simpler and more powerful than the martingale approach (compare [A-S] p. 87 and 
p. 148). It is of some interest to note that Theorem 4.1.1 does as well as Janson's 
inequality. We fix an integer r. For e = G Eq, we denote by N{G,e) the 

number of independent sets of size r that contain 
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Proposition 9.2. Consider a number u, and assume that 

(9.4) pin lY,N{G{n,p),eY< ^ iV(G(n,j9), e) >^. 

\ Y eeEo eeEo j 

Then 

( 

P{G{n,p) contains no independent set of size r) < 2exp I — 



r2(r — 1)2 



Proof. We set O = {0, 1}, provided with ttie probability that gives weight p to 1 
(and 1 — p to 0). Consider the product probability P on 0^°. For x G (a;e)ee£o ^® 
define G{x) by e{i,j) e G{x) if and only if = 1- The graph G{x) is distributed 
like G{n,p). 

Consider the set A C , given by 

A = {y; G{y) contains no independent set of size r} . 

Consider to = 2iyiog(2/P(74)). If we combine (9.4), Theorem 4.1.1 and Lemma 
4.1.2, we see that there exists x such that 



(9.5) u NiGix), e)2 < J] N{G{x),e) 

\j eeEo eeEo 

with the property that 



V(ae)eeSo , 3|/ e A , Ycte<to 

XeT^Ve y eeEo 

In particular, there exists y E A, such that if 

C ^ {e e Eq , Xe ^ ye} 

we have 



5]iV(G(x),e)<to /$^iV(G(x),e)2 

eec y eeEo 

(9.6) <^ 5^ Ar(G(x),e) 

where the last inequality follows from (9.5). The total number N of independent 
sets of G{x) of size r is 

(9.7) ^=(^^^^) ' 5^iV(G'(a;),e). 
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We must have 

N<Y,N{G{x),e) 
eec 

for otherwise there would be an independent set of size r of G{x) that would contain 
no edge of C, and thus would be an independent set of G{y), which is impossible. 
Combining with (9.6), (9.7), we get to > ri^li) , so that 

P(^)<2exp--^^— ^. □ 

In order to take advantage of Proposition 9.2, one must find competent (= large) 
values of u for which (9.4) holds. For example, one can take u — Ui/u2, where 

(9.8) P( 5^iV((G,p),e)2<«n >^ 

\eeEo / 

(9.9) P( J]Ar((G',p),e)>^i) >^ 

\eeEo J 

We then find values of U2 (resp. ui) using Chebyshev inequality (resp. the second 
moment method). Not surprisingly that leads to unpleasant computations (as seems 
unavoidable in this topic). These are better not reproduced here, and left to the 
specialist that wants to evaluate the strength of Proposition 9.2. 
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10. The Assignment Problem 

Consider a number A^, and two disjoint sets /, J of cardinal N . An assignment 
is a one to one map r from / to J. Consider a matrix a — {aij)i^ij^j, such that 
ttij represents the cost of assigning j to i. The cost of the assignment r is ^ cii,T{i) 

and the problem is to find the assignment of minimal cost. 

Assume now that the costs aij are taken equal to Xij, where the r.v. {Xij)i^ij^j 
are independent uniformly distributed over [0, 1]. Consider the r.v. 



Ljv = inf < ; r assignment 

I iei 



It is a remarkable fact [W] that E{Ln) is bounded independently of Ln- (Actually 
E{Ln) < 2 [Ka].) 

In this section we try to bound the fluctuations of Ljv; the challenge is that the 
average value of L^r is of the same order as the average value of the costs Xij, and 
that N'^ of these costs are involved. 

We will first show that we can replace the costs Xij by Yij = min^Xij, v) for v 
of order A^~^(log A^)^; then we will appeal to Theorem 4.1.1. 

A digraph D will be a subset of / x J. (If (z, j) G D, we think as i,j being linked 
by an edge.) The digraphs of use will mostly consist of those couples for which 
Xij is small. Consider a digraph D, and S C I. Se set 

D{S) = {jeJ;3ieS,{i,j)eD}. 

We will say that a digraph D is a-expanding (a > 2) if the following occurs, for all 
subsets S of /: 

N f N 

(10.1) CQxdS < 17 ^ cardi:>(5) > min acard^, 



N 1 
(10.2) cQxdS > =^ cardi:>(5) > N - -{N - caxdS) 



2 ^ ' - \ '2 

N 1 
-^CQxdD(S) >N--\ 
2 a 

Our first lemma mimics an argument of Steele and Karp [S-K] . 



Lemma 10.1. Consider an a-expanding digraph D and an integer m such that 
OL^ > N/2. Consider a one to one map r from I to J . Then, given any i E I, we 
can find n < 2m and disjoint points ii — i,i2, . . . , in+i — i such that for 1 < £ < n, 
we have (z£,r(z^+i)) e D. 

Proof. We fix z G /. Consider the set Sp of points of ip E I that have the property 
that, we can find i2,...,ip in /, for which (i^, r(i^-|-i)) G D for 1 < i < p. We 
observe that, obviously, Sp^i D T~^{D{Sp)). Since we can assume without loss of 
generality that a'^~^ < N/2, we see from (10.1) and induction that for p < m, we 
have cardSp > a^'^ . Then (10.1) shows that cardS'^+i > N/2. Then (10.2) shows 
that for p > 1, TV - coxdSm+p+i < a-PN/2. Thus N - card52m+i < or'^N/2 < 1 
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which means S2m+i — I- Thus i e S2m+i- Consider then the smaUest n for which 

i e Sn-\-\; thus n < 2m. Then one can find ii = i, ^2, is, . . . , in+i = i such that, for 
1 < £ < n we have {ii,T{ii+i)) e D. The minimahty of n imphes that the points 

ii are all disjoint. □ 

Consider tt > 0, and consider the digraph given by e ^ Xij < 

2uN-'^logN. 

Corollary 10.2. Assume that the digraph is a-expanding, and consider an 
integer m such that ct'" > N/2. Then for an optimal assignment t we have .^(j) < 
4muN-^ log TV for all i < N. 

Proof. Consider any i E I, and consider i = ii, . . . ,in+i = i as in Lemma 10.1, 
used for D — D^. Define a{ii) = T{i(,-\-i) for 1 < £ < n, and (j(i') = T(i') if 
i' ^ {zi, . . . , i^}. Since r is optimal, we have 

i'<N i'<N 

SO that 

^i,r(i)< Xi^,aM<2nuN-HogN. □ 

l<£<n 

It remains to do computations. 

Proposition 10.3. For some constant K , and allu > K, ulogN < N, the random 
digraph is u log N- expanding with probability > 1 — N~t< . 

Proof. We explain why (10.1) is satisfied with probability > 1 — N~'^/^ . The case 
of (10.2) is similar and is left to the reader. For simplicity, we set 9 = uN~^ logN. 

Consider a subset S of {1,...,A^} , and set s — card^. For j E J, we have 

P{j ^ D{S)) = (^1 - = (1 _ 2^)^ < exp(-2^.) 

and thus 

P{j e D{S)) > 1 - exp(-2^s) . 

We observe that 

< a; < 1 =^ 1 - e-^ > (1 - e-^)x . 

Thus, if we assume 

(10.3) s9<^ 
we have 

(10.4) P{j G D{S)) > -isO 
where we have set 7 = 2(1 — e~^) > 1. 

Consider 7' = (1 + 7)/2. We claim that, under (10.3) we have 

(10.5) P{cQxdD{S) < ysON) < exp (^-^) • 

This follows from (10.4) and the following general fact: 
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Lemma 10.4. Consider independent events {Ai)i<N with P{Ai) — p, and con- 
sider 5 < 1. Then, the probability than less that SpN events occur is at most 
exp{—Np/K{S)), where K{S) depends on S only. 

Proof. Set Yi — Ij^., so that 

Eexp{-XYi) = l-p(l- e-^) < exp(-p(l - e"^)) . 

Thus 

Eexp j -A ^ I < exp(-7Vp(l' - e"^)) . 

\ i<N J 

By Chebyshev inequahty we get 

P j ^ < 6pN I < exp Np{X6 - (1 - e"^)) 

\i<N J 

SO the result follows by taking A small enough that XS-{1- e-^) < 0. □ 
The number of subsets 5 of / of cardinal s is at most N^. For u> K, we have 

f / f seN\ 



and 



^exp(-£^)<iV-»/-. 



Thus, it follows that with probability > 1 — AT for all subsets S of I such that 

s = cards' satisfies 6s < 1/2, we have caTdD(S) > YdNs. Equivalently, we have 

N 

(10.6) u log iVcardS c&TdD{S) > iu log iVcardS . 

To complete the proof that (10.1) holds for a — ulogN, it suffices to show that 
cardL>(5') > N/2 whenever cccardS > N/2. This follows by applying (10.11) to a 
subset S' of S for which cardS" satisfies acardS" < N/2 and is as large as possible. 
□ 

We can now prove the main result. 
Theorem 10.5. Denote by M a median of Ln. Then (for N >3), 

(10.7) t<./I^^p(\LM-M\>^^^^) <2exp(-t2) 

V VN log log A' / 

logiV 

iVlogt^ 



[\Ln 


-M\ 


(\Ln 


-M\ 



(10.8) t > Vlog N^P{\Ln-M\> ^t}""^^ ) < 2 exp(-t^) . 



Proof. Step 1. Consider u < N/{2logN), a = ?ilogA^ and the smallest m such 
that > N/2. Set v = AmuN'^logN, and Yij = mm{Xij,v). Consider the 
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r.v. defined as Ln but using the costs Yij rather that Xij. It follows from 
Corollary 10.2 that = Ljv whenever is a-expanding, so that by Proposition 
10.3 

(10.9) P{Ln = L]V) > 1 - iV"^/-^ . 



Step 2. When AT""/^ < 1/2, it follows from (10.9) that M is also a median of 
Ljy. It then follows from (8.1.1) (and scaling) that for all w > 0, 

P(|L]V-M|>«;)<2exp(^-^) 

and combining with (10.9) we get 

P{\Ln -M\>w)<2 exp (^-^^) + N-'^^'' ■ 



Step 3. We choose the parameters. We take w = 3y/Ntv. If < log A?", we 
take u = K; if t'^ > log TV, we take u = Kt^/logN. 

Theorem 10.6 follows easily. □ 

Remark. A simple computation using Theorem 10.6 shows that the standard devi- 
ation of Liv is not more than i^(log A^)^/\/iVloglog A^. 
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11. Geometric Probability 

11.1. Irregularities of the Poisson Point Process 

In this Chapter we will consider points Xi , . . . , X^r that are independent 
uniformly distributed in [0, 1]'^, where, except on Section 13.5, (i = 2, and we will 
study certain functionals L{Xi^ . . . , Xn) of this configuration Xi, . . . , X^ (that is 
L will depend only on {Xi, . . . , X^} rather than on the order in which the points 
are taken). 

One would like to think that the sample Xi, . . . , Xn is rather uniform on [0, 1]^; 
say, that it meets every subsquare of side K/ \fN. This is not the case; there are 
empty squares of side of order {N~^ logN)^/^ (an empty square will informally be 
called a hole). More importantly, in exceptional situations there are larger empty 
squares. Several of the functionals we will study have the property that, if one 
delete or add a point to a finite set F, the amount by which L{F) can vary depends 
whether F has a "large" hole close to x. Thereby the first task is to study the size 
and number of holes. 

It is not convenient to work with the sample Xi, . . .Xn. The difficulty is that 
what happens say, in the left half of [0, 1]^ (for example, there is an excess of points 
here) affects what happens in the right half (there must then be a deficit of points 
there). Rather, one will work with a Poisson point process of constant intensity 
ji. This process generates a random subset 11 (= 11^) of [0, 1]^ with the following 
properties: 

(ll.l.ljf A and B are disjoint (Borel) subsets of [0, 1]^, 11 n ^ and 11 n 5 are 
independent. 

(11.1.2jf ^ is a (Borel) subset of [0, 1]^, the r.v. card(nn^) is Poisson of parameter 
jJi\A\^ where |^| denotes the area of A. 

Let us recall that a r.v. Y is Poisson of parameter A if P{Y = k) = e~^\^ /k\ 
for /c > 0. Thus 

E{e^^) = J2 e'^^e"^^ = exp(A(e" - 1)) . 

fe>0 

For the convenience of the reader, we recall some simple facts. 

Lemma 11.1.1. If a r.v. Y satisfies 

(11.1.3) £;(e^^) < exp(A(e" - 1)) 
for u >0, then for 

(11.1.4) PCY >t)< exp i-tlog \ 

Proof. One can assume t>\. Write 

P{Y >t)< e-*'^E(e"^) , 
use (11.1.3) and take u = log(t/A). □ 
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Lemma 11.1.2. If the r.v. Y is Poisson of parameter X, then 

P^y<^)<exp-^. 

Proof. Write, for all w > 

p(y<^^ < exp ^wEe-"^ = exp + A(e-" - 1) 
and take u — 2. □ 

For k > 1, we denote by Ck the family of the 2"^^ "dyadic squares" of side 2~''. 
So the vertices of these squares are of the type (^i2~'^, ^22"'^), < £1,^2 < S"'^, 

h,h e N. 

For C E Ck, v^e set 

= 1 if card(C n n) < //2-2'=-3 
Zc = otherwise . 
From (11.1.2) and Lemma 11.1.2 follow that 5^ = P{Zc = 1) satisfies 

(11.1.5) 5k < exp(-//2-^'^-^) . 
Now, for tt > 0, 

(11.1.6) Ee'^^^ = 1-Sk + Ske"" 

= l + 4(e"-l)<exp4(e"-l). 
By (11.1.1) the variables {Zc)c€Ck ^re independent; so that, by (11.1.6) 

Ee^'^cec, < exp 22^=4(6" - 1) 
and by Lemma 11.1.1 we have 



(11.1.7) p('j^Z„>.Uexp(-«log-^) 



\CeCk 

Observe that rik — J2 is simply the number of squares of Ck that contain 

ceCfe 

no more than 1/8 of the expected number of points of 11 they should contain. 
Combining (11.1.5) and (11.1.7) we see that 

{U.l.Sfk, Pink > 2e^22'=exp(-//2-2'=-i)) < exp(-2e^22'= exp(-//2-2'=-i)) . 

We now fix a number t, and we study how the number can be controlled if 
one rules out an exceptional set of probability < e~* . We assume t > 1, > 4. 

We denote by ki the largest integer such that 
(11.1.9) e222'=i exp(-//2-2'=i-i) < t"^ . 

Thus, ki >0 and for k > ki we have 

,2o2fc„,_/ ,,o-2A;-l\ ^ J.2 



We now observe that if a > 1, we have ^ exp(— 2^^a) < 2exp(— a), so that, 

£>0 

combining with (11.1.8) 

(11.1.10) P{yk >ki,nk< 2e^2^^ exp(-/x2-2'=-i)) > 1 - 26-^*' . 
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Lemma 11.1.3. If t < ^/]ilK, we have 

e ^ 1 



2 



Proof. It suffices to show that Sk^-i < / jJ,. Now, by (11.1.5) and (11.1.9), 

v/5^<exp(-^2-^'=0<(^)' 

Thus it suffices to show that 22^=1 > t^ije^, i.e. 2^(^1+1) > At^/e^. The function 
f{x) = e^x exp(— |u/2x) is increasing for x > 0. Thereby, since /(2^*^^i+^)) > by 
definition of /ci, it suffices to show that /(4ty//I/e^) < t'^, which is equivalent to 
exp(-ae2/2) < l/16a for a = y/Ji/At. □ 

We now apply (11.1.7), taking k = ki — 1 and v = e2'^^^t^ / ji. We observe that 
by Lemma 11.1.3 and (11.1.5) we have, for t < ^JJi/K^ 

so that 

(11.1.11) P > 1 < exp(-2t2) . 

We now go back to the sample Xi, . . and state our conclusions. 



Proposition 11.1.4. Consider t < yN/K. Denote by ko the largest integer for 
which 2^^° < A'". There exists an integer ki < ko such that 

(11.1.12) :^ - 2'^'°"''^ ^ ^log ^ 

_ ,2 

and such that with probability > 1 — Ke we have the following properties, where 
rrik denotes the number of squares C of such that 

(11.1.13) card{C n {Xi, . . . , Xn}) < N2-'^^-^ . 
For each ki < k < ko, we have 

(11.1.14) ruk < K2^^ e^^{-N2-^^-^) 
and we have 

22fci^2 

(11.1.15) mfe,_i<i^— 
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Proof. Step 1. Consider the process 11 — 11^, for /x = It follows from (11.1.4) 
that with probability > 1 — exp(— A^/i^), we have cardll < N. It is obvious that, 
conditionally on the event {cardll < N}, the number of squares C of Ck for which 

card(C fl 11) < N2~'^^~^ — n/8 stochastically dominates the number m^. Thus it 
suffices to prove (11.1.13) to (11.1.15) for rather than m^, since, as we consider 
only t < \/N /K, the term exp(— A^/i^) is swallowed by the term K ex.\){—t^). 

Step 2. We define ki as in (11.1.9). We observe that, since 1 < N2~'^''" < 4 
and t < \fN/K, we can assume < 6^2^^^° exp(-//2~^''o~^), so that ki < ko. By 
(11.1.9) and definition of ki we have 

exp//2-2('=i+i)-i < < — 

so that /x2-2'=i < Klogf , and thus 22^=0-^1) < Klogg. By (11.1.9), 

2o2fci o2A;o TVf / ATX 

exp(,2— ) > > > ^ (i^log^) 

and this finishes the proof of (11.1.12). 

Step 3. By (11.1.10), with probability > 1 — 2e~2*^^ for each k > ki we have 

(11.1.16) Uk < 2e222fc exp(-iV2-2'=-4) . 

Now we observe that n/o^ < n/c^+i. This is obvious, since, if C e C^j, one of the 
4 squares of Cfc^+i contained in C must contain at most card(n fl C)/4 points. 
Thereby, by (11.1.16), for each k > ki we must have 

nfc <8e222'=exp(-iV2-2'=-6). 
Also, (11.1.11) shows that, with probability > 1 — e~2*^^ we have 



(11.1.17) nk,-i < 



N 



The events described above occur simultaneously with probability > 1 — 3e 2*^ . 

□ 

Having studied when and how the sample Xi, . . . ,Xn can have a "deficit" of 
points, we study how it can have excesses of points. While Proposition 11.1.4 is 
central to this chapter, the following result will be used only in Section 11.4. 

Proposition 11.1.5. Consider the integer ko of Proposition 11.1. 4, and consider 
^2 ^ ^0- For k2 < k < kg consider a number r^ such that 2"^^ > > 2'^^t^ /N . 
Then, with probability > 1 — Ke~^ , the following occurs 

(ll.l.lS^ven k2 < k < ko, and given a set S G Ck with cardS < r^, then 

cardH < N ; e UiC: C e S}} < KN2-^''rk + rk log . 
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Proof. For a subset U of [0, 1]^, we have 
(11.1.19) P(card{z < N ; Xi e U} > u) < exp { -ulog 



u 



eN\U\ 

This follows from (11.1.3) and (the argument of) (11.1.6). 

For a subset S of Cfc, denote Us the union of the elements oi S. It suffices to 
consider the sets S with cardS = rfc. For these we get from (11.1.19) 

P(card{i < iV ; e Us] >u)< exp ( -ulog 



There are at most (^^^ ) < exp(rfe log(e2^'^/rA;)) choices for S. We take 

« = rfe log(e22Vrfe) + e^N2-''\k . 

Thus we see that 

UJ^"H""^''^iV2^J - U J '"^^"'"^ - (^^^ ^- 

Since (^^)^ < 2~^^ for 1 < a; < 2^^", we see that (11.1.18) occurs with probability 
at least 1 - Ke~*'. □ 



11.2. The Traveling Salesman Problem 

The Traveling Salesman Problem (TSP) requires, given points xi, . . . , xn in 
the plane, to find the shortest tour through these points; in other words, to minimize 

N-l 

\\Xa{N) - Xcr{l)\\ + ^ Ika(i) " 
i=l 

over all permutations a e Sn- The charm of the TSP is that it is the archetype of 
untractable question. In this section, we denote by L{F) the length of the shortest 
through F, and we study the r.v. Ln — L{Xi, . . . , Xn) where Xi, . . . , Xn are 
independent uniformly distributed over [0, 1]^. 

While the TSP is usually very hard, somewhat surprisingly, it turns out that 
as far as the concentration of Ljv is concerned, it is the easiest problem we will 
consider. The reason for this is its good regularity properties. The only fact we 
will use about the TSP is as follows. 

Lemma 11.2.1. Consider F C [0, 1]^, C G Ck, G d C , and assume that there is a 
point of F within distance 2"^^+^ of C . Then 

(11.2.1) L(F) < L[F UG)< L{F) + K2-^yJcardG . 



Proof. An essential property of the TSP is its monotonicity: L[F) < L{F[J{x}), as 
is seen by bypassing a; in a tour through FU{x}. This implies the left side inequality 
in (11.2.1). To prove the right hand side inequality, one first uses the (well known, 
elementary) fact that there is a tour through G of length < K2~''y/caxdG, and one 
connects this tour to a tour of F. 
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Theorem 11.2.2. Assume that the functional L satisfies the regularity condition 
of Lemma 11.2.1. Then, if Xi, . . . , are independent uniformly distributed over 
[0, 1]^ for each t > the r.v. Ln = L{Xi, . . .,Xn) satisfies P{\Ln - M\ > t) < 
Ke~* where M is a median of L^. 

Since the TSP is the simplest case we will consider, we will give the shortest 
possible proof, which is considerably simpler than the original proof. The idea of 
this proof is, however, a bit tricky; a more straightforward, but somewhat longer 
proof will be given in Section 11.3. 

The basic idea of the whole chapter is as follows: consider Q, = [0, 1]^, and the 
subset A{a) of that consists of the TV-tuples yi, ■ • • , yN for which L{yi, . . . , y]\f) < 
a. When a = M is the median of L, Proposition 2.1.1 shows that, except for a 
set of probability 2e~* , given Xi, . . . , Xn, we can find (j/i, . . . , j/at) G ^{o,) such 
that card J < Kt\/N , where J = {i < N ; Xi ^ yi} . Thus we have a tour through 
{Xi ; i ^ J} of length < M. The points Xi., i & J should be in average at distance 
< K/\fN of the set {Xi ; z ^ J}; so each of them can be inserted in the tour by 
lengthening the tour of at most K/\/N; for a total lengthening < Kt. This would 
prove that P{Ln > M + Kt) < e~* . The problem with this argument is that the 
points Xi, i & J could be precisely chosen among those which are much further 
than K/ \/N from their closest neighbor. So we have to find a way to show that 
this does not happen, or at least that the effect of this phenomenon does not affect 
the final result. The idea of this section is to give appropriate weights a{Xi) to 
each point Xi (the more isolated the point is, the higher its weight) and then to 
use Theorem 4.1.1, to minimize the influence of points with large weights. 

For X G [0, 1]^, throughout this chapter, Ck{x) denotes the square C & Ck con- 
taining X. Throughout this section, we will set F = [Xi, . . . ,Xn}, 

nk = {C eCk-, card(F n C) < N2-^''-^} , 

and ruk = cardTifc. 

We fix t < /K, and we recall the integers /cq, ki of Proposition 11.1.4. 
For X e [0, 1]^, we define 
a(x) = sup{2-'= ■,ki<k<ko; card({Xi, . . . , Xn} n Ck{x)}) < N2-^''-^} 
when the set on the right is non-empty; and we set a{x) = 2"^^° otherwise. 

Proposition 11.2.3. With probability > 1 — K exp{—t'^), we have 
(11.2.3) J^a^(Xi)<K. 

i<N 

Proof. It should be obvious that 

"^a'^iXi) <K+ J2 2-2'=card(F n U{C ; C e Tifc}) 

i<N ki<k<ko 

<K+ J2 2"^'' X iV2-2'=-^card7ife . 

ki<k<ko 
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By Proposition 11.1.4, with probability > 1 — Ke~^ , for all ki < k < ko, we have 

2fe„,„/ Aro-2fc-6> 



ruk = CQxdUk < K2^'=exp(-Ar2-^'=-'') 



The result then follows from the elementary fact that ^2 exp(— A'^2 ) < 

fc<feo 

K/N. □ 

Proposition 11.2.4. In order to prove Theorem 11.2.2, it suffices to prove Propo- 
sition 11.2.5 below. 

Proposition 11.2.5. Consider Xi, ... ,Xn, and a subset J of {1, . . . , N}. Assume 
that 

(11.2.4) ^a(Xi)<Kt 

(11.2.5) cardHk.-i < ^ • 
Then 

(11.2.6) L(Xi, . . . , Xat) < L{{X, ; « e J}) + K't , 
where K' depends on the constants in (11.2.4) o-i^d (11.2.5) only. 

Proof of Proposition 11.2.4- To prove Theorem 11.2.1, since Ljv < K\/N, it suffices 
to consider the case t < y/N/K. We fix such a t, and we consider a such that 
P{Ln < a) > e~* . We will prove that 

(11.2.7) P(Ln >a + Kt)< Ke'*^ 

and this clearly implies the result. The condition P{Ln < a) > e~* means 
P{A{a)) > e~* (where P denotes now the product measure on Q^). If we combine 
Lemma 4.1.2 and Theorem 4.1.1, we see that with probability > 1 — e~* , the set 
{Xi, . . . , Xn} has the property that we can find (j/i, . . . , j/jv) £ ^(f*) for which 



i^J ]] i<N 

where J = {i < N ; X^ = j/j}. Now, by Proposition 11.2.3 and Proposition 11.1.4 
we can moreover assume, with probability > 1 - Ke'^ that J2 a{Xif < K and 

i<N 

that (11.2.5) holds. By Proposition 11.2.5, we then have 

L(Xi, ...,Xn)< L{{yi ■ieJ}) + Kt<a + Kt. □ 



Proof of Proposition 11.2.5. We set F' = {Xj ; z e J}, G = {Xj ; z ^ J}. We have 
to incorporate the points of G into a tour through F' without lengthening too much 
the tour. 
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Step 1. For < /c < /cq, we denote by [/^ the collection of those C E Ck that 
satisfy C fl F' = 0; we set Ul.^ = Cfeg, and, for < A; < ko, we denote by Uk the 
collection of those C e Uj^ that are not included in any C e Uk-i- Thus, if C e Uk, 
its distance to F' is < 2"'=+^. 

By repeated applications of Lemma 11.2.1, we see that 

L{Xi,...,XN)<L{{Xi;ieJ}) + K J2 2" Vcard(G n C) . 

0<k<ko CeUk 

Thereby, it suffices to show that this double sum is < Kt. 
Step 2. We consider three types of terms: 
Type 1: card(G' n C) > iV2-2'=-7. 

In that case, since a{Xi) > N~^^'^, we have 

(11.2.7^-Vcard(G'nC) < -^card(G n C) < K eCOG}. 



Type 2: /c > /ci, card(G' n C) < N2-'^^-\ 

In that case, the definition of Q:(Xj) shows that a{X^ > for Xi e C. Thus 
2-Vcard(GnC) < 2-'=card(G n C) < ; e C n G} . 

We observe that the total contribution of the terms of Types 1 and 2 is < Kt by 
(11.2.4), since the union of the sets Uk are disjoint by construction. 

Type 3: k<ki, card(G n G) < N2-^^-^ . 

Step 3. We control the contribution of the terms of Type 3. We denote by Vk 
the union of the sets C E Uk for which card(G fl G) < N2~'^^~'^ . Denoting by \V\ 
the area of V, the key observation is that, under (11.2.5) we have 



(11.2.8) 



k<ki 



< ^ 



The reason is simply that if G G Cfc satisfies card(G n G) < N2 , when 
G e Uk, card(G n G) = card(F n G) < N2-'^^-\ so that, among the 2^^^^'^'^'^ 
squares C of Ck^-i that arc contained in G, at least half of them must satisfy 
card(G'nF) < Ar2-2(fci-i)-6^ so belong to Tifci-i- Thereby the area of J ^k can 

k<ki 

be at most twice the area of the union of Hki-i- 

There are 2^'^|Va;| sets G of Ck included in 14. Thus, by Cauchy-Schwarz, we 
have 

J2 2" Vcard(G n G) < 2-'=^card(G n Vk)2^''\Vk\ 

C€Ck,CCVk 

= Vcard(Gn 14)1141 . 
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Using Cauchy-Schwarz again, the sum of these terms over k < ki is at most 
V'|T/|card(GU]/) < ^/N\V\ where V= \J Vk- This is less than Kt by (11.2.8).n 

k<ki 

11.3. The Minimum Spanning Tree 

A spanning tree of a finite subset F of is a connected set that is a union of 
segments (caUed edges) each of which joins two points of F. Its length is the sum 
of the lengths of these segments. We denote by L{F) the length of the shortest (= 
minimum) spanning tree of F. An interesting difference with the TSP is that it 
can happen that L{FU {x}) < L{F). This is e.g. the case if F consists of the three 
vertices of an equilateral triangle and x is its center. 

The regularity property of L that we will use is as follows. 

Lemma 11.3.1. Consider C e Ck (k > 1) and a subset F of [0, 1]^. Assume that 
each C e Cfc-i that is within distance 2~^'^^ of C meets F. Consider a subset G 
ofC. Then 

(11.3.1) \L{F U G) - L{F)\ < K2-^\/cardG. 

Proof. Step 1. The inequality 

L{F U G) < L{F) + K2-'=(cardG')^/2 

is proved as in the case of the TSP. The problem is the reverse inequality. 

Consider a minimum spanning tree of F U G. We remove all the edges adjacent 
to G. This breaks the spanning tree in a number of pieces; and we have to add 
edges to connect it again. We will prove 2 facts. 

Fact 1. There is at most 6 cardG pieces; 

Fact 2. Each of the pieces contains a point within distance K2~^ of C. 

Once this is known, we simply take a point in each of these pieces within distance 
K2~^ of C. We build a tour of length < i^2~^(cardG)^/^ through these points to 
reconnect the pieces. 

Step 2. Proof of Fact 1. Consider three points x, a, 6 of F U G, such that the 
segments [x, a], [a;, b] both belong to a minimum spanning tree of F U G. Then we 
must have > a|| for otherwise we could remove the edge [x, a] and replace 

it by [a, 6] to get a shorter spanning tree. Similarly, we have \\a — b\\ > \\x — b\\. 
Thus the angle between the lines xa, xb is at least 7r/3. Thereby the spanning tree 
must contain at most 6 edges adjacent to each point. Thus removing k points and 
the edges adjacent creates at most 6fc connected components. 

Step 3. Proof of Fact 2. Consider a finite set H of [0, 1]^. Consider a, b in H, 
and assume that [a, b] belongs to a minimum spanning tree of H. We show that 
the "lens" 

(11.3.2) La,b = {x ; \\a - x\\ < \\a - b\\ , \\b - x\\ < \\a - b\\} 
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does not meet H. Indeed if we remove [a, h] from the minimum spanning tree, we 
split H into the component Ha containing a and the component Hi, containing h. 
If there existed c e La^h n Ha-, we could remove the edge [a, 6] from the minimum 
spanning tree, and replace it by [c, 6] to get a shorter spanning tree. Similarly, 
La,b r)Hb = $. 

Wc apply the above result to H = FUG. An edge [a, b] from a minimal spanning 
tree of H is such that La^b does not contain a square C in Ck-i within distance 
2-fc+5 q£ because it is assumed that all such squares meet F, hence H. Thus, if 
a e C, then, clearly, ||6 - a\\ < K2~^ . □ 

The main result of this section is as follows. 

Theorem 11.3.2. Assume that the functional L satisfies the regularity condition 
of Lemma 11.3.1. Then, if Xi, . . . , Xn are independent uniformly distributed over 
[0, 1]^, the r.v. Ln = L{Xi, . . . , Xn) satisfies 

yt>0, P{\Ln -M\>t)< Ke"*'/-^ 
where M is a median of Ln- 

One central idea of the approach will be to condition with respect to Xi, . . . X^, 
where m = [A^/2]. The size of the holes of {Xi, . . . ,Xn} are then controlled by 
the sizes of the holes of {Xi, . . . , Xm}, independently of Xm+i, ■ ■ ■ Xn- The main 
part of the proof of Theorem 11.3.2 is to obtain the following statement. We set 

n = [o,i]^ 

Proposition 11.3.3. Consider an integer n with |^ — n| < 1. We write Qi = Q'^, 
Q2 = ri^""; we denote by Pi, P2 the product measures on Qi, Q2 respectively. 
Given < t < \fN /K, there exists a subset Ht of Q,\ such that Pi{Ht) < i^ie~* , 
and that, whenever {xi, . . . , x^) ^ Hf, the r.v. 

L' = L'(X„_|_i, . . . , Xn) = Ln(xi, . . . ,a;„, X^+i, - - - ,Xn) 

defined on Q2 has the following property 

(ll-S-Sff P2{L' <a)> e-*\ P2{L' >b)> e"*', then b-a<Kt- 

First, we prove that Proposition 11.3.3 implies Theorem 11.3.2. To prove that 
theorem, it suffices to prove the following statement: 

If P{Ln <a)> 2e-*'/2, P{Ln >b)> 26'^'/^, then b - a < Kt. 

Consider the set A = {L^v < a} in ft^ . We wiU write = fli x O2 (^i = f^"; 
^2 = and P ~ Pi ® P2. Thus, given uji e fii, we define L' on ^2 by 

L'{uj2) = Ln{uji,uj2)- For u>i G Oi, we write 

A{u}i) = {uj2 e ^2 ; (^i,<^2) e A} . 

Since P{A) > 2e-*^/^ the set 

Ci = {a;i e Qi ; P2(^(a;i)) > e-'"/''} 
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satisfies Pi(Ci) > e-*'/^. Consider C2 = Ci\Ht, so that Pi(C2) > (e-*'/^ - 
Kie"*'). When cji e C2, we have P2(i-' < a) > e-*'/^, so that by (11.3.3) we have 
P2{L' < a + Kt) > 1 — e~* . By Fubini theorem, we get 

(11.3.4) P{Wi) > (1 - e-*')P(C2 X f]2) 

where Wi = {L^ <a + Kt} n (C2 x ^2). 
We observe that (11.3.3) imphes 

P2{L' >b)> e-*' P2{L' >b-Kt)>l- e-*' . 
Thus, we can apply the same argument as above to show that 

(11.3.5) P(W^2) > (l-e-*')P(Oi xL>2) 

where W2 = {Ln >b- Kt} n {Qi x D2), and P2(P'2) > e'*^/^ - Kie-^\ For t 
large enough, 

P((C2 X n2)\Wi) + P{{ni X D2)\W2) < P{C2 X D2) 

so that WinW2^^. □ 

We now start the proof of Proposition 11.3.3. Consider en = [0, i]2, 

and set F' = {xi, . . . Xn}- Denote by the number of squares of Ck that do not 
meet F' . We consider the integers ki, ko of Proposition 11.1.4 (defined using n 
rather than A^). We define Ht as the set of n-tuples {xi, . . . , Xn) for which 

(11.3.6j^or each k, ki < k < ko, we have 



2k-6^ 



(11.3.7) 



K2'^''H 



n 



Thereby, Pi(Ht)>l- Ke'* by Proposition 11.1.4. 

We now fix (xi, . . . ,a;„) such that (11.3.6), (11.3.7) hold and we start the proof 
of (11.3.3). For X e [0, 1]^, we denote by £{x) the smallest integer £ such that there 
is C e Ce, C within distance 2"^+^ of Ce{x), such that F' DC = $. Thus, by 
definition, we observe 

(11.3.8jf i = i{x), any square C e Ce-i that is within distance 2~^+^ of C£{x) 
meets F'. 

We also observe that if y G C^(a;) (x), then £{y) = £{x), so that = {x; £{x) = £} 
is a union of squares of C^. 

Lemma 11.3.4. a) We have, for each ki < k < ko that 
(11.3.9) l^fcl < Kexp(-n2-2'=-6) 
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b) 



e<ki 



Kt^ Ke 

< < 

~ n ~ N 



Proof. Let us denote by the union of the elements of Q that do not meet F' , 
and set Ug = U'f,\ [j U^. It suffices to observe that if a; e V^, then C^(a;) is within 

distance 2~^+^ of f/g, so that \Vi\ < K\Ui\, and the result follows from (11.3.6), 
(11.3.7). □ 

We consider the function g{x) = 2-™*^('=i'^(^)). By (11.1.12), we have 

K / n \ 1/2 K 

(11.3.10) yU<2-.<_(,og^) <-. 

By (11.3.9) and an obvious computation, we have 

(11.3.11) \\g\\2<K/y^. 

To prove (11.3.3), we have to prove that if a, b are such that P2{L' < a) > e^* , 
P2{L' >b)> e~* , then b — a< Kt. We now appeal to Corollary 2.4.5 with u = Kt, 
for the function h{x,y) = g{x) + g{y). Prom (11.3.10), (11.3.11), we see that we 
can find yn+i, • • • , Un, Zn+i, ■ ■ ■ , zn such that 

(11.3.12) L'{yn+i, . . . , i/at) < a ; L'{zn+i, ...,ZN)>b, 
and 

(11.3.13) Y.^g{yi) + g{zi)) < Kt 

where J — {n + 1 < i < N ; yi ^ Zi}. 

Consider the set F that consists of the points xi, . . . ,Xn, as well as the points 
yi, i ^ J. We will prove 

(11.3.14) \L'iyn+u ...,yN)- L{F)\ < Kt . 
The same argument will show that 

\L'{Zn+l,...,ZN)-L{F)\ <Kt 

and this will finish the proof. 

First we observe from (11.3.1) that if Fi D F, and if £{x) > ki, then \L{Fi U 
{x}) - L{F)\ < Kg{x). Thereby, it follows from (11.3.13) that we can add to F 
all the points yi, i ^ J, for which £{yi) > ki, without changing the value of / by 
more than Kt. Denote by G the set of the other points y^. We observe that G is 
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contained in IJ V£. Consider C E Ci, C G V£. By (11.3.1), we have, for any set 

£<ki 

Fi containing F, that 

|L(Fi U (GnC)) -L(Fi)| < K2-\caxdGnCf^^ . 
Thereby it suffices to show that 

^ 2"^ ^ (cardG n C^/^ < Kt . 
e<ki ccVi 

But this is shown as in Step 3 of the proof of Proposition 11.2.5. □ 

11.4. Gabriel Graph and Voronoi Polygons 

Given a subset F of [0, 1]^ its Gabriel graph is the set of edges [a,b] such that 
the closure La,b of the set La,b of (11.3.2) meets F only in a and b. When the set F 
has the property that it does not contain points x, y, z such that — y|| = — 2||, 
(a property that is satisfied with probability one for random sets) this is equivalent 
to saying that F contains the edge [a, b] if and only if La,b does not meet F. In 
that case, the Gabriel graph contains the minimum spanning tree, as is shown in 
the course of the proof of Lemma 11.3.1. As in the case of the MST, at most 6 
edges are adjacent to each point of F. 

We denote by L{F) the length of the Gabriel graph. An interesting feature 
of this functional is that in certain special configurations adding a single point 
creates a big decrease of L{F). A typical such configuration consists of the points 
(0, fc/n), (1, /c/n), < k < n. The Gabriel graph contains all the edges between 
(0,A;/n) and {l,k/n). All these edges will disappear when one adds the middle of 
the unit square to F. The following lemma shows that the previous example is 
close to be the worst possible behavior. 

Lemma 11.4.1. Consider C e Ck, F a subset of [0, 1]^, and assume the following 
{llA.iySvery element C of Ck-i that is within distance of2~^~^^ of C meets F. 
Then if G G C, we have 

(11.4.2) \L(F) - L(F U G)\ < K2-^card{{FiJG)nB{C,K2-^)} 

where B{C,r) denotes the set of points within distance r of C . 

Comment. The difference with Lemma 11.3.1 is that the bound depends now upon 
FUG rather than G alone. 

Proof. As already seen, a point is adjacent to at most 6 edges, and, as in the case 
of the MST, edges adjacent to G have a length < K2~'^ . Thus 

L{F U G) < L{F) + K2-^cQxdG . 

To prove the reverse inequality, we observe that the edges [a, b] that belong to 
the Gabriel graph of F but not to the Gabriel graph of F U G are exactly these for 
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which Lo,b\{(i,&} meets G but not F. Then \\a — h\\ < K2~^, for otherwise there 
would exist C e Ck-i within distance 2"^^+^ of C that would not meet F. This 
implies, since La,b meets G, that a,b E B{C^K2~^). In the Gabriel graph of F, 
there are at most 6-card(FnS(C, K2~^)) edges adjacent to points in -B(C, K2~^), 
so at most that many edges can be removed. □ 

Another natural example of functional that satisfies Lemma 11.4.1 is the total 
length of the Voronoi polygons. If F is a subset of [0, 1]^, and x E F, let us 
define the Voronoi polygon Vx of x as the set of all points y of [0, 1]^ for which 
d{x,y) — d{y,F\{x}). (This name is a bit abusive since when x is close to the 
boundary of [0,1]^ this set is not a polygon). Denote by L{F) the sum of the 
lengths of the Voronoi polygons of all points of F. We sketch a proof that L{F) 
satisfies the condition of Lemma 11.4.1. First, we observe that iiy eVx, there is no 
point of F within distance less than y || of y. Thus, if a; e G, the Voronoi polygon 
of X (with respect to FU G) is under (11.4.1) entirely contained in B{x, K2~^), so 
is of length < K2-^ . Thus L{F U G) < L{F) + K2-^caxdG. To prove the reverse 
inequality, consider a point a belonging to the Voronoi polygon of x e F, with 
respect to F, but not with respect to F U G. Then there is no point of F within 
distance less than — a|| of a, but there is at least a point of G. Under (11.4.1) we 
have a,a; e B{C,K2~^); but the total length of the part of the Voronoi polygons 
of F contained in S(G, K2-^) is easily seen to be < K2-^card(F n S(G, K2-^)). 

Theorem 11.4.2. Consider a functional that satisfies the condition of Lemma 
11. 4-1- Set, as usual, L = = L{Xi, . . . ,Xn), and consider the median M of 
Lm- Then 

(11.4.3) Vt>0, P(|L-M| >t) < Kexp ^--^min(t2,(tViV)2/3) 

In particular, the tails of Ln are subgaussian for values of t up to A^^/^. We 
now sketch, in the case of the Gabriel graph, why, within logarithmic terms, the 
exponent in (11.4.3) is correct for t > N^/^. We give an informal argument, that 
could be made rigorous. For simplicity, let us argue about F(n), where 11 is a 
Poisson point process of intensity N. Consider u < y/N, and let a = u/\fN < 1. 
Denote by k the cardinality of Iln [0, a]^. When k is even, conditionally on k, with 
probability > (l/Kk^)^, the k points of Iln [0, a]^ are such that each of the discs of 
center (ry, 2£a/k), for rj G {0, a}, 1 < i < k/2, and of radius a/Ak^ contains exactly 
one of these points. Then the Gabriel graph of 11 contains the edge from the point 
in the disc of center (0, 2£a/k) to the point in the disc of center (a, 2£a/k), for a 
total length of order ka. Now with overwhelming probability k is of order u'^; so, 
with probability > we get the exceptional configuration described above 

that creates an abnormal length of order t — u^a — u^/\/N. Now u — (t\/iV)^/^, 
and ^ 

{-i^y >exp(-l(tx/iV)2/3logtViv) . 

So this later quantity is a lower bound on the probability that we get an abnormal 
length of order t that will have L exceed the median by t. 

To prove Theorem 11.4.2, we observe that, since iF^vl < KN by (11.4.2) it 
suffices to prove (11.4.3) for t < N/K. We follow the scheme of Section 11.3. It 
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suffices to be able to modify Proposition 11.3.3, so that when t < \fN / K (11.3.3) 
can be replaced by 

{llAA}fP2{L' <a)> 2e-*', P2{L' >b)> 26"*', then 6 - a < K (t + 
Once this is known, as in Section 11.3, we prove that 

P{L <a)> 2e-*'/2 ^ p(^L >b)> 2e-*^/^ imply b - a < K (^t + . 



This implies Theorem 11.4.2 since, if we set w = t + t^/^/N, for u < N/K we have 
t < ^/N /K; moreover, we have > min(w^, (u^/N)'^/^). 

The construction of Ht and the proof of (11.4.4) will parallel the proof of Propo- 
sition 11.3.3. In order to avoid repetition, we will not repeat the entire argument, 
but simply explain the necessary modifications. 

The construction of i^t is modified as follows. We require that for ki — l<k<ko, 
and each subset S of C^, with card^ < rk, then 

(11.4.5) cardii <n;x^e U{C ; C e S}} < KN2-^''rk + rk log 

rk 

where we set rk^-i = 2^'^it^/n and for k > ki we set 

rk = Kt2^'=-^'=o . 

We observe that, using (11.1.12) 

\/N n 



provided K is large enough. It then follows from Proposition 11.1.5 that imposing 
these extra conditions does not change the fact that Pi{Ht) > 1 — Ke~^ . 

We change the definition of the function g{x) to 



g(^x) = ^ ^ofco-max(fci,^(a;))-^4 

Thus 



(11.4.6) ||,|U<_L(2^.-.).<|.(,o,^)%^ 



and, obviously, (11.3.11) still hold. 

Suppose now that we are given a, b with 

P2iL' <a)> 2e-*' , P2iL' > b) > 26"*' . 

Using Proposition 11.1.5 again, we see that we can find a set A C {L' < a}, 
P2{A) > , such that whenever {yn+i, ■ ■ ■ , Un) £ A, we have 
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(11.4.7j*'or each /ci — 1 < /c < fco, for each subset S of Ck, such that card^ < r^, 
then 

cardin + 1 < z < iV ; G U{C; C e S}} < KN2-'^^rk + fog . 

We then consider, using Proposition 11.1.5 again, a subset B of {V > b} with 
P2{B) > e~* , such that when {zn+i, ■ ■ ■ ,zn) £ B, the property similar to (11.4.7) 
holds. 

We now appeal to Corollary 2.4.5, to find (j/n+ij • • • , Un) £ A, (zn+i, ■ ■ ■ , zn) £ B 
such that if J = {i; n + 1 < i < N , Ui ^ Zi}, then 

(11.4.8) J29iyi)+9{zi)<Kt. 

ieJ 

We denote by F the collection of points that consists of the points {xi)i<n, 
together with the points yi, i ^ J. We denote by G the collection of points j/j, 
z e J. We have to show that 

(11.4.9) \L{F U G) - L{F) \<K{t + t^/^/n) . 

Let us denote by Se the collection of squares C E Ci that contain at least one 
point Vi, i e J, £{yi) = £. It follows from (11.4.1) that, if F C Fi C F U G, and if 
C & Si, we have 

|L(FiU(GnC)) -L(Fi)| < K2-^c&Td{{F U G) n B{C, K2-^)} . 

Thereby, adding to Fi all the points of fl G, where = U{C ; C e Se}, we 
cannot change the value of L by more than 

(11.4.10) 2-^card{(F U G) n B{Ue, K2-^)}) . 
Since by definition, for i{yi) = £ > ki, we have 

and since ^ g{yi) < Kt by (11.4.8) we see that 

card^^ < Kt2'^^-^''° . 

Now, B{Ui, K2~^) is contained in a union of < -ftTcardS'^ squares C of Ci. 
Thereby, it follows from (11.4.5), (11.4.7) that the quantity (11.4.10) is bounded 

by 

2-^K(N2-^^ri + re log ) < Kt2^-^° 

re 

and these quantities have a sum < Kt. 
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Now we have to control the influence of the points Ui for which l{yi) < ki. 

We denote by Vg the set {i{x) = £}. We recall that by Lemma 11.3.4 we have 
I U ^1 — Kt'^/N. Since Vi is union of squares of Ci, we have in particular that 

= for £ < ks, where 2-^=3 < Kt/VN. Adding to a set Fi such that F C Fi C 
F U G the points of 

G nVi, can, by (11.4.1), change the value of L by at most 

2-^card((F UG)nB(Vi, K2-^)) . 

Now we observe that \B{Vi, K2~^)\ < K\Vi\. Thus the total contribution of the 
points of G n V£ is bounded by 

(11.4.11) 2" Wd((F U G) n V) 

where \V\ < Kt'^/N and F is a union of squares of Cfc^-i. The summation of all 
these quantities over £ > ^3 is a most, using (11.4.7) 

Kf f p22^i-2\ 
K2-'=3card((F u G) n F) < ^ Ar2-2'=Vfc,_i + tu,-i log 

Kt^ ( 22^=1 ^ KN 
^ ^ 1 + ^ log 



But, using the definition of /ci, the last term is easily seen to be bounded by a 
constant. □ 

11.5. Simple matching 

In this section (for reasons that will become apparent later) we work in [0, l]'' 
for d>2. 

A matching of a set F is a decomposition of F as a union of disjoint pairs of 
points (points of the same pair are matched); we make the convention that when 
cardF is odd, there is exactly one point that is unmatched (does not belong to 
any pair). A minimum matching is a matching that minimizes the sum of the 
distances of pairs of matched points. We denote by L{F^ the length of a minimum 
matching of F. For simplicity, the point to which a given point is matched is called 
its partner. 

Our interest in that functional stems from the fact that it apparently does not 
have good regularity properties. It is obvious that 

L{Fy^{x))-L{Fy^{y)) < \\x-y\\, 

but in certain configurations this cannot be improved upon. The problem is that 
if one tries to match y to a point different from the partner of x, the partner of x 
has to find a new partner, etc., and there is no apparent way to control this chain 
reaction. 

While the behavior of F is not good as far as the change of one point of F is 
concerned, the situation is somewhat better when a significant number of points of 
F are changed. We set L'(F) = sup{L(F') ; F' C F}. 
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Lemma 11.5.1. |L(F) - L{G)\ < L'{FAG) + Vd. 

Proof. Consider U = F\G,V = G\F. Consider a minimal A4 matching of F, and, 
for a E F, denote its partner by 9{a). Consider 

H ^{ae F\U- e{a) G U} . 

When we remove U from F, the points of H lose their partners. Set H' = {6(a) ; a & 
H}. Thus H' C U. To find partners for the points of yu if we consider a minimum 
matching of V U H' . This matching induces a matching Ai' of V U H, using the 
bijection 9 of H and H' . The union of the trace of Ai on F\[U U H) and Ai' is 
almost a matching of G, although it could happen that there remains an unmatched 
point in FUiy and one in F\{U[JH). Then two points are then matched together 
(creating the term y/d) . The matching we have constructed witnesses that 

L{G) < L{F) + L{V UH') + Vd 
< L{F) + L\VUU) + Vd. 

To see it, it suffices to use the triangle inequality, and to observe that the edges 
[a, 9(a)] for a E U' do disappear from A4 when U is removed. □ 

Here is a simple observation. 

Lemma 11.5.2. Consider subsets Fi, . . .,Fp of [0, l]''. Then 

\i<p I i<p 

where, as in the rest of this section, K denotes a constant that depends on d only. 

Proof. It suffices to prove this for L rather than L' . The point is that if one 
considers an optimal matching of each Fj, their union fails to be a matching of 
(J Fj only because there could remain an unmatched point in each F^, while we 

are permitted at most a single unmatched point. Thus, it suffices to match all but 
at most one of these points, using for example a shortest tour through them, and 
matching consecutive points on the tour. □ 

It seems an interesting question whether when d = 2 the inequality of Theorem 
11.2.3 would hold, at least for smaller values of t. Possibly easier is the question 
whether the variance of Ljv is bounded. The best results in that direction belong to 
Rhee. She proved that if d = 2, VarLvv < K(\ogN)'^ [Rl], while if > 3, VarLjv < 
KN^~^/'^ [R2]. The arguments for these results are different. Our methods do not 
allow to improve on the result for d — 2, but allow significant improvement when 
d > 3 (and this is why we consider this case in this section). Although this has not 
been checked, it seems to be an exercise to show that VarLjv > ■^N^~'^^^ using 
e.g., the method of [R3]. What we wiU prove is that VaxLN < (logN)^ N^~'^/'^. 
The proof goes by first proving a Poissonized version of the result, and then using 
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"dePoissonization" . The second part of the argument is standard (see e.g. [Rl]) 
and will not be given here. 

The Poissonized version of the problems is the study of the r.v. Lx = L{Ilx), 
where Il\ is the random subset of [0, 1]'' that is generated by a Poisson point process 
of constant intensity A. We consider the space 0, of all finite subsets of [0, l]'^; and 
on O, we consider the probability F\ induced by n^. On il^, we consider the 
function 

(11.5.1) f{F,G) = L'{FAG). 

For a subset B of Q, we set 

(11.5.2) f{F,B) = mif{F,G) = inf L'(FAG) . 

We set 7 = 1 — 

Theorem 11.5.3. For all A > 3, all subsets B ofVl, we have 



j 



exp J^YJ. <^n(F) < ' 



{iog,\)'<\y '- Px(B) 

If we combine this result with Lemma 11.5.1 (and proceed as usual) we see that 
if M\ denotes a median of we have 



which certainly implies the previous claim about the variance of L\. To prove 
Theorem 11.5.3, we will prove the following statement, which form is adapted to 
proof by induction. 

Proposition 11.5.4. There exists numbers Kq, a > 1 depending on d only, such 
that for all q > we have, for all X, 1 < X < 2"^ and all Borel subsets B of il, 

To see that this statement implies Theorem 11.5.3, we take for q the smallest 
such that A < 2"^, so that is of order log A, and Kq of order (logA)^. 

The proof of Proposition 11.5.4 is by induction over q. For the case q = I, one 
uses the brutal bound 

f{F, G) < K{cavdF + cardG) 

and the exponential integrability of Poisson random variables. The easy details are 
left to the reader. 
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We will determine, in due time, suitable values for Kq and a and we now start the 
proof of the induction step from q to q+1. Consider A such that 2"*^ < A < 2"^^\ 
Consider the smallest integer n such that A' = X/n^ < 2"^. (Thus, we can apply 
the induction hypothesis to A'.) By definition of n, we have A/(n - 1)'^ > 2"' , so 
that, since A < 2"'^\ we have (n - 1)'^ < 2("-i)"', and thus 

(11.5.3) n"^ < 2("-^)"' < 2*^ • 2("-i^"' . 
Also, 

(11.5.4) A' = A > f " > ( " > 2"-'^ . 

Consider a partition of [0,1]'^ in n'' congruent cubes (Ci)i<^d. From Lemma 
11.5.2, we observe that 

(11.5.5) L\FAG) < L'{{FAG) n Q) + Kn"^-^ . 
We set 

fi{F,G) = L'{{FAG)r)Ci). 
Thus we have, from (11.5.5) 

L'{FAG) < J2 fiiF,G) + Kn'^-^ . 

Thus, if we set 

^(F,G)= mf^ Y,h{F,G) 

we get by (11.5.2) that 

(11.5.6) /(F, B) < g{F, B) + Kn''-^ . 



The crucial point is that {fl, Px) is naturally isomorphic to the product of n'^ 
copies of (O, Py)- To see this, let us denote by Ri an affine map from Cj to [0, 1]^, 
for i < n^. Then the isomorphism simply associates {Ri{F fl Ci))i<^nd to F. We 
observe that 

/i(F, G) = -L\Ri{F n Ci)ARi{G n d)) 
n 

so that, under this isomorphism, each function fi is distributed like the function h' 
on Q"^ (Provided with Py <S> Py), where h'{F,G) = ^L'(F A G). Moreover, with 
the notation of Definition (2.4.1), we have fh' = g- By induction hypothesis, and 
taking the scaling factor n into account, we have for each Borel set S C il, 

exp{2h{F,B))dPy{F) < 



Px'iB) 
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where h — ah', a — n{2KQ\' ^) ^. It then foUows from Theorem 2.5.1 and the 
definition of g that 

Vt < 1, / e^^{atg{F,B))dPx{F) < —^e^^{ZnH'') 

for each Borel set B C ft. From (11.5.6), it follows that 

/ exp(at/(F, B))dPx{F) < -|— exp(3n^t2 + Kn'^-^at) . 

We see that if 

(11.5.7) n'^/'^-^a<K, 
then, taking t = n~^l'^ jK, we get 

Now, 



a 



since d'j = d/2 — 1. Thus, provided Kq = 2K, (11.5.8) is exactly what we need to 
complete the induction. 

Thus, it remains to check that (11.5.7) holds; but by (11.5.3), (11.5.4) 

a/n <X'~^ < 2T"^2-^"' 
so that (11.5.7) holds for a = 1 + 27. □ 
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12. The free energy of Spin Glasses at high temperature 

Consider a sequence (ei)j<7v with G {—1, !}• Each represents the two 
possible values of the spin of particule i. Consider numbers {hij)i<i^j<N that 
represent the interaction between spins. The energy of a given configuration is 
given by hijeiSj. Consider a parameter /? > (that plays the role of the 

l<i<j<N 

inverse of the temperature). The so-called "partition function" is given by 

(12.1) Zjv = Z^(/iz,) = 2-^ J2 ^''P(^ ^^J^^^j) 

The role of the factor y/N is for normalization purposes that will become apparent 
later. 

If we think to as a Bernoulli r.v., it is natural to write 

(12.2) ZN{hij) = E,exp{^ ^ /i^,•e^e,■). 

In the model we study, the numbers hij are random, and the sequence (/iij)i<i<j<7v 
is i.i.d. We assume Ehij — Ehfj — 0, and we assume for normalization purposes 
that Ehfj = 1. We will also assume that E exp a\hij\ < oo for a small enough. 
Then EZ^ is well defined for N large enough. We are interested in the quantity 
N~^ElogZN (mean free energy per site), whose study relies ultimately on the 
study of Zn. It is proved in [A-L-R], and in [C-N] in the case where hij is gaussian, 
that for /3 < 1 the random variable logZ^ — (3'^N/4: converges in law to a (non- 
standard) normal r.v. Equally interesting, but of a rather different nature is the 
research of tail estimates for logZ^ — P'^N/A that are valid for all N. 

Theorem 12.1. There exists a universal constant K with the following property. 
Assume that Eexp±hij < 2. Then, forO<t< N/K, fi <l, 



(12.3) p(|logZ^ - ^1 > K{t+^log^—^)VN) < 2e-*^ 
In particular 

K I 2 1 /?2 K 

(12.4) log^- < -LElogZN - — < —. 

Comment. In the condition Eexp±hij < 2, the number 2 can be replaced by any 
other (with a different constant K). It seems reasonable to conjecture that (12.3) 
is not sharp in the gaussian case, and that, for a given (3 < 1, 




> t) = 0. 
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It should however be pointed out that (12.3) does not hold when the factor 
is removed from (12.3). Indeed it would follow otherwise that for each n, 
supE{4:Zn/ P'^ N)'^ < oo, and it is pointed out in [A-L-R], p. 6, that this is not the 



N 

case. 



The key to Theorem 12.1 will be the following deviation inequality 

(12.5) 0<t< 4^/NiN - 1) ^ P(| logZ^ - Mn\ >t)< 2exp ( g^^'^l^^ ) 

where Mn denotes a median of log Zn . We first show how to deduce this from 
Corollary 2.4.4. The second crucial step will then be to relate Mjv and 0^N/4: 
XogEZr,). 

To prove (12.5), we observe that 

(12.6) |logZ^^(/^y) -logZ^^(/^^,■)| < ^ J] \hij - 

l<i<j<N 

as follows from the fact that 



J2 (^i3^i^j\ < W 

^<i<j<N l<i<j<N 



^3 I 



We now view log Z function on MJ^^^ ^)/'^ _ "Wc wish to apply Corollary 

2.4.4, in the case $7 = M, h{x,y) = j\x — y\,iJ, the law of hij. We note that (2.4.12) 
holds, since 

JJ exp - \x — y\diJ,{x)diJ,{y) — (^J exp -|a;|(i^(a;)j 

< (Eexp|/ii,|)'/' 

< [E{exphij + exp —hij))^^"^ < 2. 

Consider now v and the set A = {log^Ar < v}. Combining (12.6) and (2.4.13) 
(used for N{N — l)/2 rather than N) we see that for u > v, we have 

u-v< Af3\/N{N - 1) 

^ P({logZ^ > «})P({logZ^ < v}) < exp ( - ^A__A_) 

Taking successively u = and v = Mn, (12.5) follows as usual. 

In order to relate Mn and P'^N/A, the key step is the elementary estimates 

1 d'^N 
(12.7) — exp < EZn < K exp — — 

K A 4 



(12.8) EZ%<,^^{EZNf. 
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These will be proved later. First, we conclude the main argument. Consider the 
set A = {Zn > \EZn}. Then 

EZp^ = E{Zn1ao) + E{Zn1a) 

< lEZN + E{Zj,y/^P{A)^/^ 

so that 

^^^^-4 EZl 

(a fact going back to Paley and Zigmund.) Combining with (12.8), we get -P(A) > 
(1 — To get a lower bound for M^r, we can assume < \o^\EZf^. We set 
t = log(i£;ZAr) - Mat. Since log Z^ > 0, we have Mn > and hence t<K + N/A 
by (12.7). 

We certainly have 

A C {log Zn>Mn + t] 

Thus, by (12.5) we have 

^^}—J^ < P(A) < 2exp ( -) 

K .1/2 



so that 



and thus 



t<KViV(log^) 



Mat > log {\eZn) - kVn {log j^)'^"- 
We also have Mjv < log(2£^Zjv). Combining with (12.7) we get 

SO that (12.3) now follows from (12.5). 

To prove (12.4), we first observe that the lower bound follows from (12.3) and 
a routine computation. The upper bound follows from the concavity of log, which 
implies E'logZjv < log-E^Ar, and (12.7). 

It remains to prove (12.7), (12.8). We start with the elementary inequality 

9 H 4 

/-n-^ nn'-' 

' 2 3! ' ~ 4! 



that is obvious on power series expansions. Thus, for \u\ < i, we have (since 
Ehl = 1, Ehij = Eh% = 0), 



(12.9) 1 + ^ ~ ^ Eexp{uhij) < 1 + — + Ku^ 
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and thus, for e = ±1, (3 < 1, 



Since 

EZiv = exp Sie ^^'^ 



13 



(12.7) follows. Turning to the study of EZ%, we have, using (12.9), and for N >8, 
that, with obvious notations. 



l<i<j<N ^ 

<KE,E,,exp(^ ^ ^(e,e,.+e^e;.)') 

i<i<i<Ar 

Now, we have (e^e^ +e^e^)^ = 2 + 2ejeje^e^-. Also, eje^e^e^- is distributed like eje^, so 
that 

EZ^<Kexp(^)£;,(|^( J2 2e.e,)) 

l<i<j<Ar 

Now, 

E 2e,6, = ( ^ 6,)' -AT. 

^<i<j<N l<i<N 

Using the subgaussian inequality 

t 



N .2 



i=l 

we have 



= 2/?V(l-/5') 

so that (12.8) follows. □ 
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13. Sums of (vector valued) independent random variables 

The first objective of ttiis section is to discuss tfie genesis of some key ideas of the 
isoperimetric approach. This wiU be helped by a simple (but rather typical) example 
of application of Theorem 3.1.1. We will then discuss, in detail, a situation that 
parallels the situation of Chapter 8, but where the infimum over a E Tis replaced by 
a supremum. There are unexpected and subtle differences; this is closely connected 
to the fact that the conditions on the function h{x,y) in Theorem 4.4.1 are (and 
must be) highly disymmetric in x and y. 

Consider a family of A'^-tuples a — {ai)i<N, cti > 0; and for y — {yi)i<N set 

(13.1) Z{y) = sup ^ aiyi . 

Consider a sequence (^i)i<Ar of positive independent r.v. Consider the r.v. 

(13.2) Z = Z^ = Z(Xi,...,Xjv) = sup ^ttiXi. 



We denote by X* the non- decreasing rearrangement of the sequence (Xj)j<jv. 
That is, 

(13.3) X* = sup{t ; card{j < N ; Xj > t} > i} . 

It is useful to note that 



X* = sup < ^ Xi ; card/ = k> 

i<k Kiel J 



The key motivation for Theorem 3.1.1 is the following, where r = supjcij ; i < 
Proposition 13.1. Consider a > 0, g, /c e N. Then 



(13.4) P{Z > qa + t) < ^-—^ + P { t^" X* > t 

^ ' \ -H ^ J - qk+ip(^z <a)i \ 



Comment. To make this inequality useful, one has to estimate the last term, and 
then to choose a, qf. A; in a efficient way. 

Proof. Set VL = M+. Consider the set 

A = ^(a) = {yel7^;Z(y)<a} 

where Z{\j) is given by (13.1). 
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Consider J C {1, . . . , N}, and a & J-'. A key observation is that, by positivity 
(13.5) ^ aiUi < ^ QiUi . 

i€J i<N 

Consider now , . . . , y'^ E A, x E , and set 

I={i<N;Xi^{yl...,yn}. 



Consider a partition {Jj)j<q of {1, ... , N}\J such that if z e Jj, then Xi 
Then, for a e J^, by (13.5), since y^ e A 

^ aiXi = ^ ^ (^iVl < go 

i^I j<q i€Jj 



Vi- 



and thus 



Hence (with the notation of Section 3.1.1) if f{A, . . . , A,x) < k, we have 
(13.6) ^Xi<qa + T^x*. 



i<N 



i<k 



If we provide the i^^ factor of with the law of Xi, then (13.4) foUows from 
(3.1.3) and (13.6). □ 

Remark. Certainly one can bound the term ^ ctjXj in a less brutal way, by 

sup Yl Cti^i- 

a€ J^,card/=fc i^I 

Consider now a sequence (Xi)i<jv of Banach space valued r.v. A number of 
classical problems of probability (in particular, laws of large numbers and laws of 
the iterated logarithm) depend crucially on sharp estimates of the tail probability 
S ^ill — ^)- many years these estimates were found using martingales, 

i<N 

and the results were not optimal. One big obstacle is that there is no obvious sub- 
stitute for the positivity arguments that are central to Chapter 8 and to Proposition 
13.1. Although its importance became clear only later, a crucial contribution was 
made by M. Ledoux [L-1]. It was known at the time that in many situations, the 



tails of 



E X, 

i<N 



resemble the tails of 



i<N 



where {gi)i<N is an independent 



sequence of standard normal r.v. that is independent of the sequence X^. To study 



E 9iXi 



i<N 



Ledoux wrote 



(13.7) 



+ 



i<N 



i<N 



i<N 



i<N 
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where Eg denotes conditional expectation given {Xi)i<N. The idea was that either 
term of the right-hand side should be easier to study than the term of the left-hand 
side. This is particularly apparent for the second term, where, arguing conditionally 
on Xi, one can take advantage of the properties of Gaussian processes. 

It turns out that the first term in the right of (13.7) has the exact property needed 
to replace the positivity used in Proposition 13.1; namely, if J C {1, ... , N}, we 
have 



(13.8) 



ieJ 



< E, 



i<N 



The realization of the importance of positivity-like properties led first to the char- 
acterization of the Banach-space valued r.v. that satisfy the law of the iterated 
logarithm [L-Tl]. Perhaps more importantly, (13.8) lead this author to the belief 
that some isoperimetric principle should be relevant, and hence to the theorem of 
[T2] (that is now superceeded by the comparable, but much easier to prove The- 
orem 3.1.1), and started the line of investigation that culminates in the present 
paper. 

The author also understood that Bernoulli r.v. have regularity properties that 
almost match those of Gaussian r.v. (a crucial step is the comparison theorem 

of [T6]). They offer the extra advantage that the tails of J2i<N ^i-^'i (where 

P{ei = —1) = P{ei = 1) = 1/2) always resemble the tails of 
rather than (13.8) one should write 



<N 



. Thus, 



(13.9) 








< 








) 




i<N 




i<N 




i<N 




i<N 





To study the last term conditionally on (Xj)j<iv, one can rely, in particular, 
upon the following result. 

Theorem 13.2. Consider vectors (fi)i<i<Ar in a Banach space W, and set 

I \ 

(13.10) a= I sup I ^t(;*(vi)2:t(;* e W* , \\w*\\ < 1 

\ (i<N 

Consider a sequence {Xi)i<N of independent real valued r.v. such that \Xi\ < 1. 



Denote by M a median of the r. v. 



i<N 



Then for t > we have 



(13.11) 



i<N 



M 



>ta\ < 4 exp 



16 
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Proof. We observe that if we set 

= {(w*{vi)) ;w* eW* , \\w*\\ < 1} 

then 



Z = 



i<N 



sup 



i<N 



Thus Theorem 13.2 is a special case of Theorem 8.1.1 (using scahng). 



□ 



Remarks. 1) Certainly the constant in the exponent is not sharp, and could be 
improved using (4.2.7) rather than (4.1.3), especially in the case of Bernoulli r.v., 
where the use of (4.3.8) would yield a bound of exp — |(t — ^/\og2)'^ for t > ^/\og2. 



2) There is another bound on the tails of 



i<N 



namely the trivial bound 



XiVi < sup E \w*{vi)\, and it is, of course, possible to interpolate be- 

i<N l|w*ll<l i<N 

tween this bound and (13.11). This can be done as follows. For a sequence {ri)<N 
of real numbers, and t > we write 

-P^i,2((ri), t) = inf <j + t wH ; n = Ui + Wi 



i<N 



,i<N 



where the infimum is taken over all possible decompositions = Ui + Wi. We set 

K{t) = svip{Ki^2{w*{vi),t);w* e W*, \\w*\\ = 1}. 



We observe that K{t) < ta. Only rather trivial modifications to the proof of 
Theorem 8.1.1 are needed to see that one can improve (13.11) into 



(13.12) 



i<N 



M 



> K{t) < 2exp 



16 



This inequality streamlines a result of [D-MS] . 

If one observes that K{2t) < 2K{t) one obtains through a routine computation 
that, for all p > 1 



i<N 



<M + Kk{^) , 



a rather precise form of the so called Kintchin-Kahane inequalities. It should also 



be pointed out that by (13.12), 



i<N 



> M - Kk{^), that 



i<N 



> 
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M2-^/f (obviously) and that 



i<N 



> Kk{^Jp/K). To prove this last in- 



equality, one reduces to the real-valued case; it is simple to see that this follows 
from [L-Tl] lemma 4.9 (see also [M-S]). 

We will not pursue the discussion of how to build upon (13.4), Theorems 3.1.1 



and 13.2 to obtain sharp bounds for the tails of 



E X, 

i<N 



This is done in great 



detail in [L-T2], Theorem 6.17. An alternative approach, that relies rather on 
Theorem 4.2.4 is developed in [T3]. The bounds obtained through these methods 
apparently are sharp in all situations (see however, [Ro] for a case where other 
ingredients are needed). Instead, we continue the investigation of r.v. of the type 
Z — sup E cui^i that was started in Chapter 8. In order to apply Corollary 8.2.2, 

we need to have (4.4.6), where h{x, y) = \x — y\, or, if is always positive, h{x, y) = 
{x — y)~^. When the variable Xi is positive (i.e., its law /i is supported by M"'"), 
inspection of Theorem 4.4.1 shows that (whatever choice of 6, ^) no integrability 
condition on Xi except boundedness, will insure that the conditions of this theorem 
hold for this choice of h. We will now give an example that shows that this is not an 
artifact of our approach. We will show that (13.11) cannot be essentially improved, 
even if P(|Xi| 0) is arbitrary small. This implies (by scaling) that, given any 
finite function — > 1R+, with $(0) = 0, one can find a real r.v. xi with 
/ ^{\Xi\)dX < 1, and vectors {vi)i<N such that (13.11) is violated. 

Example 13.3. This example is essentially a re-interpretation of the example pre- 
sented at the end of Section 4.3. Consider an independent sequence (Xi)i<Af of 
Bernoulli variables such that P{Xi = 1) = p is small. Consider the family !F of 
A'"-tuples of form ctj = l/\/2pN if « e /, ccj = otherwise, where / varies over all 
subsets of {1, . . . , N} of cardinality < 2pN . Then cr = 1. Consider 



sup 

aeJ^ 



aiXi = , ^ sup Xi] card/ < 2Np } 



i<N 



We can also view Z as 
where the norm || • is given by 

The main observation is that 

^Xi< 2Np =^ Z 

i<N 



where Cj is the canonical basis of M^, and 



x\\jr = sup E QJilajj 

aeJ^ i<N 



i<N 



Since the probability of the event on the left goes to 1, as N ^ oo, the r.v. Z is 
asymptotically normal, of mean y/Np/2 and variance ^/(l — p)/2; so its deviation 
from its median do not decay faster than exp —Kt^. 

The conclusion to be drawn from Example 13.3 is that, in order to extend The- 
orem 13.2 to the case where Xi is unbounded, we must require conditions of a 
different nature than integrability. 
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Theorem 13.4. There exists a universal constant L with the following property. 
Consider a convex function on R""" such that il^^x) < x^ if x <1 and il:{x) > x if 
X >1. Consider a probability measure ji on IR. Assume the following 

(13.13) Vt>0, ii{{x; \x\>t}) <2exp{-LiJj{2t)) . 
Given any subset B ofM., with fJ>{B) > 1/2, and any t > 1, we have 

(13.14) fx{{x ; V( inf \x - y\) > t}) < e-\l - fi{B)) . 



Consider independent real valued r.v. (Xj)j<iv distributed like ji, and vectors 
{vi)i<N is a Banach space W. 

Then, for all t > 0, we have 



(13.15) 



i<N 



M 



>t \ < 2exp ( --^:p{t) 



where M is a median of 



E XiVi 

i<N 



where 



J'^{{w*{vi))i<N;w* e W\ \\w*\\ < 1} 
and where is defined in Section 8.2. 

Proof. According to CoroUary 8.2.2, it suffices to prove that the hypothesis of The- 
orem 4.4.1 hold when h{x,y) = \x — y\, in the case 9{x) = — logx, w{x) = —^logx 
(so that H{^,w) holds by Proposition 2.6.1). Only (4.4.4) has to be checked, since 
(13.14) is a rewriting of (4.4.5). 

Consider B cR with fi{B) < 1/2. Consider 
(13.16) s = mi{\y\;yeB}. 

Clearly, h{x,B) < \x\ -\- s. Thus, by convexity of ij) we have 

/ exp'^(/j,(a;, S))(i//(a;) < exp -7/;(2s) / exp -il){2x)dii{x) . 
Jm 2 2 

On the other hand, by (13.16) we have Bd] — s,s[= 0, so that -B C {a;; |a;| > s} 
and hence by (13.13) we have exp V'(2s) < {2/^i{B)f/^. Thus it suffices to show 
that for L large enough we have 



1 . ^2 
a; < - ^/ - 
- 2 \x 



1/2L 



< —= (= exp —to (a;)) 

IX 



where / = /^^ exp i?/'(2a;)(i//(a;). Thus it remains to show that under (13.13) 



lim / = 0, uniformly in -0, an easy exercise left to the reader. 



□ 
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Theorem 13.4 can be applied to the case where is a measure Z/'^ of the type 
considered in Proposition 2.7.4, although in that case the simpler Theorem 2.7.1 
will yield the same conclusion. There are however, situations covered by Theorem 
13.4 that are not covered by Theorem 2.7.1, because in (13.14) we require only 
t > 1. In particular, if the law of X satisfies (13.14), and if ||.Z^||oo ^ 1? the law 
of 1{X + Z) satisfies (13.14) (it is not required that Z be independent of X). 
(The corresponding statement for (13.13) is also true, under mild conditions on ip, 
replacing if needed 1/3 by a smaller number.) 

In conclusion of this section, we want to discuss a question that apparently is 
not fully clarified by the results of the present paper. Consider numbers (ai)j<Ar, 
and vectors {vi)i<N in a Banach space. Of which order are the fluctuations of the 



r.v. Z = 



around its median M, when p is seen as a random element 



i<N 

of the symmetric group Sn, provided with the uniform probability PI 
Proposition 13.5. a) Assuming |ai| < 1 for each i, we have 



(13.17) 
where as usual 



t>Q^P{\Z-M\ >t) <4exp 



16(72 



fT^ = sup i ^ W*{Vif -^w* & W* , \\w*\\ < 1 



i<N 



b) Assuming \\vi\\ < 1 for each i, we have 
(13.18) t >O^P(|Z-M| >t) <4exp 



16 E a: 



i<N 



2 • 

i 



Remark. A first problem is to find a bound that contains simultaneously (13.17) 
and (13.18). 

Proof. The proof follows that of Theorem 8.1.1, using now Theorem 5.1 rather than 
Theorem 4.1.1. Thus, we indicate only the key points. 

To prove a), one notes that for p, r e S'jv, and I = {i < N ; p{i) ^ T(i)}, then 



i<N 



i<N 



iei 



To prove b), one observes that Z has the same distribution as 
with the notations above, one now has 



i<N 



; and. 



i<N 



i<N 



< 



iei 



□ 
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It should be pointed out that it is most hkely that a phenomenon similar to that 
of Example 13.3 occurs in case a, and that (13.16) cannot be improved even if a 
large majority of the numbers are equal to zero. 
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